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ON THE FOUNDATIONS OF STATISTICAL THERMODYNAMICS* 
Ву Г. ROSENFELD, MANCHESTER 
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of high esteem and respectful friend- 
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The aim of this essay is to survey the logical structure of statistical thermodynamics 
and to realize how faithfully this theory reflects the peculiar dialectical character of the 
physical situation. The author tries to outline very briefly the foundations of the atomistic 
interpretation of the laws of thermodynamics, laying the emphasis on the physical mean- 
ing of the formal arguments. The points that he wants to make are no doubt familiar 
to those physicists who have cared to look into the matter; but there is still widespread 
confusion concerning some of them, which may afford some justification for taking up 
the question from a synthetic point of view. In this connexion, it is not at all superfluous 
to discuss the classical approach as well as the quantal one: the true significance of the 
latter can only be perceived by contrasting it with the former. г 


The atomistic interpretation of thermodynamics, on a statistical basis, was elab- 
orated in friendly emulation by two of the deepest thinkers of the XIXth century, 
Maxwell and Boltzmann. After Maxwell’s untimely death, Boltzmann had to carry 
on single handed a gallant fight against the rising tide of idealistic and positivistic 
criticism, which for a time swept even the solid bastions of physics. This reaction 
was the — hardly conscious — reflexion in the minds of physicists of the turmoil 
agitating the bourgeoisie during the period of transition to the imperialist phase of 
capitalistic society. In the field of physics, the effect of such a critical mood was, 
on the whole, far from detrimental: even an idealistic approach is not barren when . 
it is controlled by close contact with concrete reality. In fact, the work of the critical 
school was a necessary phase of the dialectical movement which liberated the mate- 
rialist conception of the physical world from its narrow mechanistic frame and carried 
it to the wider viewpoint of quantum theory. 

Boltzmann himself remained to the last on the ground of mechanistic materialism. 


‘He did not live to witness the unfolding of the new synthesis and enjoy the final 


triumph of his views. But in his controversy against the school of energetics” and 


‘the other opponents of atomism he was right in every point. With sure intuition he 


* Received September 93, 1954. 
| [3] 


emphasized those fundamental aspects of the atomic theory, which were bound to 
survive even the most drastic changes in our conception of the structure of matter. 
Admittedly, a robust faith was needed to uphold atomistic views in. front of the 
accumulating evidence which pointed to a severe limitation of the validity of the 
equipartition theorem. Boltzmann felt, however, that while the breakdown of equi- 
partition undoubtedly demonstrated our ignorance of deep-lying features of atomic 
constitution, there simply could not be anything wrong in the general physical ideas 
underlying the statistical treatment of atomic motions and interactions. The fruit- 
fulness of these ideas, their power of coordinating qualitatively different phenomena, 
was for him a warrant of their conformity with essential elements of reality. 

To appreciate the superiority of Boltzmann’s philosophical standpoint, it is 
instructive to contrast it with the timid and ambiguous attitude of Gibbs. While 
Boltzmann remained undaunted, Gibbs retreated before the difficulties. He en- 
deavoured to make use only of those parts cf the formal structure of statistical mechan- 
ics which were susceptible to a fully rigorous treatment with the methods then 
available. This means that he shunned the whole problem of the ergodic hypothesis, 
in spite of its fundamental importance, and that with regard to the physical inter- 
pretation he took refuge to the crudest platonistic conception of mechanical "anal- 
ogies" of the thermodynamical laws. If one takes this view, the real physical basis 
of the theory, and especially the essential character of uniqueness of the atomistic 
interpretation, is utterly lost. In fact, the paradoxical outcome of such an idealistic 
treatment is an atomic theory of heat from which the concept of atom is wellnigh 
banished. The exquisite formal elegance of Gibbs' treatise should not blind us to 
the fundamental inadequacy of the underlying philosophy. 

We now know that Gibbs’ mathematical scruples were beside the point. Lebes- 
gue's theory of the measure of sets and extension of the concept of integral have 
supplied mathematical tools perfectly adapted to the needs of statistical thermodyna- 
mics. With their help it has become possible to give a precise formulation of the 
conditions under which the ergodic theorems are valid; and these mathematical 
conditions correspond to an easily visualizable physical situation. The failure of the 
classical treatment is therefore not due to any formal inconsistency, but to a real 
departure of the atomic processes from the behaviour of classical mechanical systems. 

The extension of the methods of statistical thermodynamics to the radiation 
field by Rayleigh and Planck made it conspicuous that the difference between atomic 
and classical laws is a radical and deep-lying one. For a radiation field, the atomistic 
elements whose statistical behaviour is analyzed are the proper electromagnetic oscilla- 
tions of which it is composed. The breakdown of equipartition in such a simple and 
transparent situation brought about the crisis out of which the new synthetic con- 


ceptions of quantum theory eventually emerged. Quantum theory has removed all 


the uncertainties and contradictions which had hampered the completion of statistical 
thermodynamics on a classical basis. Moreover, by forcing upon our consciousness 
the dialectical character of physical laws, expressed by the idea of complementarity, 
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it has enabled us to get a clearer view, or at least a clearer formulation, of the comple- 
mentary relationship underlying the ergodic theorems. 

We are thus now in a position to survey completely the logical structure of 
statistical thermodynamics and to realize how faithfully this theory reflects the peculiar 
dialectical character of the physical situation. To bring this out is the aim of this 
essay, in which I shall try to outline very briefly the foundations of the atomistic 
interpretation of the laws of thermodynamics, laying the emphasis on the physical 
meaning of the formal arguments. The points that I want to make are no doubt fa- 
miliar to those physicists who have cared to look into the matter; but there is still 
widespread confusion concerning some of them, which may afford some justification 
for taking up the question from a synthetic point of view. In this connexion, it is not 
at all superfluous to discuss the classical approach as well as the quantal one: the 
true significance of the latter can only be perceived by contrasting it with the former. 


$1. The basic assumptions of atomic theory 


‘he most immediate assumption underlying any atomistic interpretation of 
phenomena of matter in bulk is that these phenomena are the outcome of processes 
involving immense numbers of atoms. This is of course the basis for the represen- 
tation of macroscopic physical quantities by statistical averages taken over correspond- 
ing quantities pertaining to the atomic system, the fluctuations of the latter quantities 
becoming negligible as the number of degrees of freedom of the system increases. 
Above all, it is a necessary requirement for understanding the emergence of macro- 
scopic irreversibility from the reversible phenomena on the atomic scale; for the 
statistical irreversibility expressed by the ergodic theorems has an essentially asymp- 
totic character. Even this elementary point is obscured by the formalistic approach 
of Gibbs, who lays a misplaced emphasis on the derivation of exact formulae valid 
for any number of degrees of freedom. This pointless elegance irritated Boltzmann: 
"I can understand statistics applied to a gas, he remarked!, but I see no point in 
applying it to a sewing machine." | t 

No less essential than the large number of the atoms entering into the consti- 
tution of ordinary bodies is the identity of their structure. The importance of the 
uniformity of atomic structure for a coherent account of large scale regularities was 
properly emphasized by Maxwell; he compared molecules with "manufactured ar- 
ticles”2. Incidentally, he carried the metaphor so far as to assert that the existence 
of these molecules, all "manufactured" on the same pattern, pointed to the existence 
of a Manufacturer: — a delightful example of the indirect, but very substantial impact 
on physical thought of the new social outlook dominated by the development of mass 


1 This utterance was reported in conversation Ъу Ehrenfest. It certainly has the characteristic 
Boltzmannian touch. 


2 Maxwell, 1873 [Sci. Papers, Vol. ЇЇ, p. 376]. See also Maxwell 1875 [Sci. Pap., Vol. II, p. 483] 
and 1871, Chapt. XXII in fih. Maxwell attributes the comparison to Sir John Herschell. 
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production methods; natural philosophy no longer culminates into the apotheosis 
of the craftsman — potter or clockmaker — but in that of the selfsatisfied factory 
owner. 

The full incorporation of the structural identity of the atomistic elements (par- 
ticles or radiation waves) into the description of systems of such elements has only 
been achieved within the framework of quantum theory, where it is embodied in 
the requirement that any state vector pertaining to a system of identical elements 
be either symmetrical or antisymmetrical in the sets of variables defining the elements. 
This condition, by which the degeneracy arising from the various permutations of 
the elements is removed from the start has also a direct physical meaning: it expresses 
a fundamental property of the elements, corresponding to their classification into 
"bosons" and ”fermions”, which has important repercussions upon their thermo- 
dynamical behaviour. 

In the classical treatment, no such new physical principle is recognized, of course; 
but from the logical point of view the situation is in no way different: here also one 
has very essentially to take account of the identity of the elements in a manner appro- 
priate to the formalism. I shall go into this point in some detail, since it is still currently 
misrepresented, in spite of Boltzmann's crystal-clear analysis. We have simply to 
consider that a physical state of a syscem of identical elements does not uniquely 
correspond to an ordered sequence of the sets of variables defining the states of the 
elements, but rather to the whole set of such sequences taken in any order. А phys- 
ical quantity, expressed in terms of the element variables, is accordingly invariant 
with respect to the permutations of these variables; in other words, it is a function, 
not of any particular sequence of variables, but actually of the set of all the sequences. 
In Boltzmann's (1877) terminology, an ordered sequence of element variables, i.e. 
a point in the phase space of the total system (more precisely, a small neighbourhood 
around such a point) is a "complexion"; for the set of complexions arising from 
each other by permutations of the element variables, and thus representing the same 
physical state of the system, Boltzmann himself uses the word "Zustandsverteilung", 


Ehrenfest (1911, p. 37) the more picturesque "Stern", which may perhaps best be . 


rendered as "constellation". 

Now, for the computation of statistical averages of physical quantities, it is 
obviously convenient to extend the required integrations over the whole phase space 
of the system, ie. over all complexions, whereas the physical average should be 
. taken over the constellations. To reduce the integral over phase space to a physical 
average, we must therefore divide it by the number of permutations of the elements. 
Thus, the mathematical representation of the system in phase space, which operates 
with complexions, is not immediately adapted to the physical situation, and the 
adjustment needed to restore the connexion with physically meaningful quantities 
is directly derived from the identity of the elements. 


Confusion on this point may easily arise from a rash assimilation of the element 


variables entering into the complexions to the elements themselves. It is clear, how- 
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ever, that the concept of complexion is a purely mathematical device, and that 
individuality of the element variables, far from corresponding to a similar property 
of the elements, is a spurious formal feature which has to be removed. The abstract 
character of the complexions is aptly expressed by Boltzmann’s (1877, p. 171 and 
p. 200 in Wiss. Abh.) graphic account of the statistical calculations: he works with 
"labels" (Zettel) on which sets of atomic variables are inscribed, and imagines these 
labels put into an urn from which they can be drawn at random to form complexions. 
He never indulges in the misleading — and perfectly superfluous — fiction of attach- 
ing the labels to the atoms, which would be tantamount to regarding the atoms them- 
selves as distinguishable from each other. On the contrary, the whole trend, of Boltz- 
mann’s argument can only be understood on the assumption that the atoms are 
essentially undistinguishable. | 

Gibbs’ (1902, Chapt. XV) discussion of "systems composed of molecules” in 
the last chapter of his treatise is in essential agreement with Boltzmann’s, but as 
usual he rather spoils the tale by undue reticence. His distinction between "specific" 
and generic” phases is identical with Boltzmann's distinction between complexions 
and constellations, and the relation of the concept of generic phase to the assumption 
of the identity of the molecules is clearly indicated (Gibbs 1902, p. 188). Unfortunate- 
ly, there is no hint of the fundamental significance of this assumption: the very 
fact that it is only mentioned, almost casually, at the end of the book shows how 
reluctant Gibbs was to commit himself to an atomistic philosophy. 

Before leaving the question of the structural identity of the atomic elements, 
it will not be superfluous to dispel a possible misunderstanding. In classical theory, 
the individuality of the elements is actually maintained, in spite of their structural 
identity, during the course of their interactions: in fact, the trajectory in phase space, 
which represents the temporal evolution of the system, establishes a definite connection 
between the successive complexions it takes up. Such a connection is, of course, 
entirely lost in quantum theory, in which the concept of trajectory is replaced by 
a mode of description of the interaction processes most consistently upholding the 
impossibility of individualizing the elements. This radical difference between the 
classical and quantal treatment with regard to the permanence of the individuality 
of the elements in no way conflicts with their essential similarity in respect of the 
statistical behaviour of these elements. The temporal aspect of the phenomena is 
in fact quite irrelevant for the statistical properties of the state of equilibrium; the 
whole purpose of the ergodic theorems is precisely to replace averages taken over 
the temporal succession of complexions by purely statistical averages defined by 
a stationary distribution in phase space. In Boltzmann’s discussion of the H-theorem, 
in particular, the statistical aspect is introduced by the "Stosszahlansatz", which 
does not express the true variation in time of the distribution, but only the asymptotic 
average of this variation. The averaging process, of course, removes any trace of 
temporal succession and, accordingly, of individuality of the elements. 

# 
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$2. The central problem of statistical thermodynamics 


The identification of macroscopic quantities with averages over their atomistic 
counterparts involves only a trivial use of statistical conceptions. But in the analysis 
of heat phenomena the most fundamental features of statistical theory are brought 
into play and the statistical aspect appears as a deep-lying element of the whole atom- 
istic conception of matter. Heat is, in fact, the macroscopic manifestation of a dis- 
ordered form of motion or propagation of the elements constituting the material 
bodies and radiation fields; by which is meant that the parameters describing this 
motion or propagation escape the control of the macroscopic observer, whose know- 
ledge of the state of the system is restricted to a few mechanical and thermal charac- 
teristics. The statistical approach to thermodynamics is thus immediately related 
to the fact that the very definition of the macroscopic states of the system entails 
a limitation of our control of the microscopic behaviour of its elementary consti- 
tuents. This is indeed the dialectical contradiction which arises in the development 
of a consistent atomistic picture, and which resolves itself into the synthesis of sta- 
tistical thermodynamics. 
The central problem is the statistical interpretation of the second law of thermo- 
dynamics, in its twofold aspect, concerning the behaviour of a system under controlled 
(quasistatic) and spontaneous transformations, respectively. In the first place, we 
have the statement of the existence of a quantity called entropy which belongs to 
each state of thermal equilibrium of the system, and whose variation when passing 
to some other equilibrium state in a controlled way is a well-defined function of 
the mechanical and thermal parameters characterizing the transformation. The 
statistical meaning of entropy is well illustrated by such simple processes as the mixing 
of two gases. For the increase of entropy in this case, when considered from the atom- 
istic point of view, can only be a function of the numbers of atoms involved in the 
mixing process: in fact, it is simply related to the greater probability of the disor- 
dered state represented by the mixture, as compared with that of the separated gases. 
Quite generally, entropy appears as a statistical measure of the degree of disorder” 
of the microscopic state of the system. Alternatively, to use a fashionable terminology, 
we may say that it is a measure of our "information" pertaining to the microscopic 
state; but we must not forget (a point which is not always properly appreciated in 
the so-called theory of information) that we are here referring to a very special type 
of information, viz. that which is obtained or lost by physical operations involving 
controlled changes of mechanical and thermal parameters, and that the entropy is 
not merely a mathematical concept, but itself a physical quantity, directly related 
to the quantity of heat exchanged between the system and other bodies during the 
process. _ 

= The other part of the second law, according to which the entropy of a thermally 
isolated system increases when the system undergoes a spontaneous transformation 
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from one state of equilibrium to another, expresses the essentially irreversible char- 
acter of physical processes on the macroscopic scale. From the atomistic point of 
view, this means that the thermal evolution of an isolated system, when its elements 
are left to follow the course determined by their own laws, tends statistically towards 
states of greater and greater probability. The derivation of this fundamental property 
is the object of the ergodic theorems, which form the backbone of statistical thermo- 
dynamics. Asymptotic irreversibility is indeed a very general feature of statistical 
"chains" of events: it is exhibited by results of lotteries just as well аз by assemblies 
of atoms. It is especially remarkable in the latter case, however, in view of the revers- 
ibility of the elementary laws of motion: the conciliation (in a statistical sense) of 
the dialectical opposites of macroscopic irreversibility and microscopic reversibility 
is the essential achievement of the statistical interpretation of the second law, strik- 
ingly illustrating the synthetic character of this interpretation. 

From the preceding outline it is quite clear that the introduction of the quantal 
description of the elementary constituents as a basic assumption instead of the classical 
picture does not make the least difference to the fundamental structure of statistical 
thermodynamics; for the quantal laws, just as the classical ones, are reversible with 
respect to time, and the problem of establishing the macroscopic irreversibility by 
taking account of the statistical element involved in the concept of macroscopic 
observation remains unchanged and is again solved by ergodic theorems. The issue 
has been obscured by the fact that quantum theory itself, in contrast to classical 
theory, introduces a statistical element at the microscopic level; and it has sometimes 
been confusedly argued that it is this elementary quantal statistics which provides 
the basis of macroscopic irreversibility. In reality, we have here two completely dis- 
tinct statistical features, which are not only logically independent of each other, 
but also without physical influence upon each other. The question whether the elemen- 
tary law of change is deterministic (as in classical physics) or statistical (as in quantum 
theory) is entirely irrelevant for the validity of the ergodic theorems. (This purely 
mathematical property of ergodic chains shows, incidentally, how groundless is the 
opinion that the application of probability considerations to physical phenomena 
should have to be "justified" by the existence of some "deterministic substra- 
tum"). А 

The two quite independent statistical features which pertain to the behaviour 
of the quantal elements on the one hand, and to the definition of the macroscopic 
states of a system of such elements on the other, both arise from a synthetic relation- 
ship of complementarity between mutually exclusive aspects of the phenomena. 
The complementarity between the macroscopic and microscopic modes of descrip- 
tion of an atomistic system presents indeed (cf. Bohr 1932) a striking formal analogy 
with the familiar pattern of reciprocal limitation in the determination of quantal 
variables: a complete knowledge of the microscopic state of the elements precludes 

- the use of such macroscopic concepts as temperature and entropy; conversely, the 
macroscopic description implies a widegoing renunciation of the knowledge of the 


microscopic state. But of course, this complementarity must be carefully distinguished 
from the entirely different dialectical situation of quantum theory. 

While it does not alter the main logical issue, quantum theory has, of course, 
an important physical signif icance for the establishment of statistical thermodynamics. 
In the first place, the physical consequences of quantal statistics, which contain 
those of the classical theory as limiting cases, are free from the inconsistencies of the 
latter. The equipartition of energy, which was the source of these inconsistencies, 
is reduced to a law of asymptotic validity; moreover, as already noted, the identity 
of the elements is more correctly taken into account, so as to cope with even such 
extreme cases of non-classical behaviour as that of the pure radiation field or the 
metallic state of matter. Above all, quantum theory makes possible, in a much more 
compendious way than classical theory, a unified treatment of both matter and radia- 
tion (and in fact of any physical system whatsoever). This is, of course, a consequence 
of the elimination of the dualism between matter and force and its replacement by 
the conception of the complementary manifestations of particle and field for every 
‚ constituent of the physical world. | 

A logical building up of statistical thermodynamics starts with the consideration 
of isolated systems and the establishment of the ergodic theorems which define the 
corresponding equilibrium distribution. The systems of actual physical interest are 
usually not isolated, but in interaction with their surroundings. However, the deri- 
vation of the equilibrium distribution under such conditions can be reduced to the 
ergodic case by conceiving the system under investigation as embedded in a much 
larger system, which may be treated as isolated to any desired approximation (just 
by making it large enough). The second step in the development of the statistical 
theory is therefore the study of the distribution of a small part of an isolated system. 
The final step consists in linking up the equilibrium distribution with thermodynamical 
quantities; this is achieved by setting up, in each case, a function of the statistical 
distribution which has all the properties of the corresponding characteristic function 
in termodynamics. For the discussion of the ergodic theorems it is necessary to give ` 
separate treatments for the classical and quantum theory, because the formulation 
has to be adapted to the radically different conceptions of the atomistic states of the 
system in the two theories. Once this is done, and a formal correspondence between 
classical and quantal expressions for statistical operators is established, the remaining 
steps of the argument can be presented in a unified manner. 


83. Classical statistics of isolated systems - 


Ergodic theorems can be enunciated for mechanical systems of the most general 
description, and it is advisable to begin by discussing them with this high degree 
of generality and abstraction, in order to bring out the part played by the. special 
characteristics of atomistic systems in the adaptation of the mathematical formalism 
to physical requirements. But before coming to the exposition of this formalism 

: ? 


The Foundations of Statistical Thermodynamics 11 


let us.try to- gain an orientation into the problem by approaching it from а semi- 
empirical point of view. 

Let us consider the temporal evolution of an isolated system with a very large 
number of degrees of freedom, from a certain instant at which all connexions with 
the surroundings have been severed. We know from general experience that after 
a period of adjustment, or "relaxation", it will tend to a stationary state in which it 
will persist indefinitely. From the atomistic point of view, the value of any macro- 
scopic quantity pertaining to the system will be interpreted as the average of a definite 
microscopic quantity over the interval of time necessary for the measurement. Such 
a time interval may be extremely short in the macroscopic scale and yet cover the 
passage of the system through a large number of successive microscopic configura- 
tions. In the present case, however, the time average will be the same over any interval, 
except perhaps during the period of relaxation; the macroscopic value pertaining 
to the stationary state of the system may thus be defined, without appreciable error, 
by an average extending over an infinite time. The introduction of this idealized con- 
ception of the time average is the essential step which opens the way to the applica- 
tion of statistical methods. For such an average, extending over all the microscopic 
configurations actually taken by the system, is obviously independent of the order 
in which they have been successively taken; it is therefore equivalent to a definite 
statistical average, i.e. an average defined by a distribution law indicating the relative 

. frequencies of occurrence of the various configurations of the system in the course 
of its evolution. Such a statistical distribution law, from which the time has disap- 
peared, completely characterizes the siationary state of the isolated system, and the 
fundamental problem of the statistical thermodynamics of isolated systems is the 
derivation of this distribution law. 

The above argument, however, is incomplete in as much as it takes for granted 
the empirically observed irreversible tendency of the isolated system to a stationary 
state. This behaviour, as already pointed out, is by no means obvious in view of the 
reversibility of the elementary law governing the change of configuration of the system 
with time. Our task is therefore not only to find the conditions under which the time 
average of any quantity exists independently of the initial state of the system, but 
also to define the precise meaning of the macroscopic irreversibility, i.e. to deter- 
mine in what sense we may say that any initial distribution of states tends towards 
the statistical distribution expressing the time average. This is the object of the er- 
.godic theorems. 

Using the convenient geometrical representation of the microscopica! configura- 
tions of the system by points of a symbolic "phase зрасе”, we may visualize its tem- 
poral evolution from some initial phase as a trajectory in this space. The macroscopic 
time evolution, however, is not truthfully represented by any such single trajectory, 
nor even by any bundle of trajectoriés whose initial phases have a known distribution: 
for the density of this distribution, by Liouville’s theorem, will remain the same as 
we follow it up along the trajectories, and will therefore not exhibit any tendency 
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towards any limiting distribution. Of course, this is just the irreversibility paradox 
in a new guise: we cannot hope to conjure up any irreversible trend from purely me- 
chanical considerations. 

The way out has been described with great clarity by P. and T. Ehrenfest (1911, 
p. 60—62) in their masterly critical survey of the problem: we must distinguish 
between a "fine" distribution of phases, which is defined on the microscopic scale 
and is transported without, alteration along the trajectories, and a "coarse" distri- 
bution, corresponding to a subdivision of phase space into finite cells (which may 
be of arbitrary size and shape). The physical meaning of such a distinction is clear: 
it expresses the fact that the macroscopic determination of the state of the system is 
represented, not by a single phase, but by an unknown distribution of phases, which 
we may in the most favourable case imagine reduced to a finite cell surrounding 
a given phase. The coarse distribution is thus the mathematical element by which 
the standpoint of the macroscopic observer is introduced into the analysis; and it is 
just this coarse distribution which reflects the macroscopic appearance of irreversi- 
bility. In fact, on this "coarse" scale, any initial distribution spreads out gradually 
through the whole region of phase traversed by the trajectories; in the same way as 
a little milk poured into a cup of coffee is diluted by stirring until a homogeneous 
beverage is obtained: yet all the time, the space occupied by the milk, although de- 
formed into thinner and thinner streamers, keeps the same volume. This simile so 
aptly illustrates the "mixing process" in phase space which explains macroscopic 
irreversibility that even Gibbs (1902, p. 145) did not forsake it in the obscure pages 
he grudgingly devoted to the problem in his book. 


We are now in a position to turn to the mathematical aspect of the theory, and 
to examine how it fits the physical situation. The mathematical relationships are best 
illuminated from the point of view of group theory?. We conceive the time evolution 
as an automorphism of phase space, the infinitesimal transformation of which is ex- 
pressed by the Hamiltonian equations of motion. Trajectories are thus regarded as the 
sets of all successive transforms P,, by this group, of some initial phase P,,, and 
it is the invariance of such sets with respect to the group that governs the global 
properties of the trajectories which are of interest for statistical considerations. Above 
all, it is essential that it should be possible to define a measure in phase space, inva- 
riant for the "Hamiltonian group" (as we shall call the automorphism in question). 
The existence of such an invariant measure is guaranteed by Liouville's theorem, 
and its simplest expression is obtained by using canonical coordinates Pi: q; and 
a euclidean metric in phase space. The volume element dw = IT;dp,dq; has indeed 
the required invariance, and it is easy to derive from it an invariant ar er measure 


do on the ”energy surface” which contains all the trajectories of an isolated system 
of given energy. 


Тыа Пы ыл e ee а =. 
* This was emphasized Ьу В.О. Koopman (1931). 
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No other property of the motion than the measure invariance expressed by 
Liouville’s theorem is needed for the proof of Birkhoff’s fundamental ergodic the- 
orem?^, which applies to any conservative mechanical system, provided only that 
all its trajectories are contained within a finite domain of phase space, — an assumption 
obviously fulfilled by all systems of physical interest. Birkhoff's theorem asserts 
the existence of the time average 

tot+T 


F (P) = тт [| ДР) dt a) 
Too 
to 

of any phase function f (P) for almost all trajectories, i.e. almost all initial phases Р,. 
The value F(P,) of the average is, of course, independent of the initial instant ¢,, 
but does in general depend on the trajectory (represented by one of its phases, P,). 
It is easy, however, to indicate a general condition necessary and sufficient to ensure 
the constancy of the time average almost everywhere on the energy surface: it is the 
metrical indecomposability of this surface, i.e. the impossibility of decomposing it 
into two invariant sets (with respect to the Hamiltonian group) both of positive meas- 
ure. Moreover, if this condition is satisfied, the constant value F of the time average. 
can readily be found by considering the integral of the time average over all initial 
phases, i.e. over the whole energy surface. For such an integration, taken in the sense 
of Lebesgue, the occurrence of exceptional trajectories, for which formula (1) breaks 
down, is immaterial, since they form a set of measure zero. This is just an example 
of how exquisitely the theory of measure meets the requirements of statistics. The 
existence of exceptional trajectories must of course remain a stumbling block for 
any attempt at a rigorous treatment of the ergodic problem unless full use is made 
of the resources of Lebesgue's algorism. The resulting expression for F is 


– 5, ДР) ас, (2) 
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where 2 = f dø is the invariant measure of the energy surface. 
S 


Although the argument leading to formula (2) is based on the consideration of 
a definite trajectory, it holds just as well if we start from any coarse distribution 
of initial phases. The quantity F given by formula (2) thus represents the statistical 
average equivalent to the time average, i.e. the macroscopic quantity corresponding 
to the phase function f(P) in the state of equilibrium. The equilibrium distribution, 
called ergodic” (or in Gibbs’ terminology "microcanonical"), may simply be des- 
cribed as uniform over the energy surface with respect to the invariant measure 
on this surface. We may still be somewhat more realistic, and introduce also some 
uncertainty in the determination of the energy of the system, which amounts to 
Teo ave ta ce Sans 


за G., D. Birkhoff (1931). The first deg to the modern ergodic theory was given by J. v. Neu- 
mann; see the historical survey by B. О. Koopman & С. D. Birkhoff (1932). 
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considering an “energy shell” of finite thickness instead of an energy surface. We 
must then assume metrical indecomposability for almost all the energy surfaces 


contained in the shell, and replace (2) by a formula of the same type, 


1 
Е = E лр) do, (3) 
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in which the integration now extends over the energy shell 5 whose volume is denoted 


by X; in other words, we get a coarse ergodic distribution uniform over the energy 


-— + 


shell. 

So far Birkhoff's theorem. It establishes the existence — and the form — of 
the statistical equilibrium distribution, but it sheds no light on the "mixing process" 
by which it is reached asymptotically. The condition of metrical indecomposability 
of the energy surface tells us that the bundle of trajectories issuing from any initial 
cell will eventually "fill up" the whole energy surface or shell, but it is not sufficient 
to ensure that the initial coarse distribution will itself in the course of time fill the 
surface or shell uniformly. А rather trivial example, which, however, sufficiently 
illustrates the issue, is that of a rigid rotation around a fixed axis: any distribution 
of phases will in this case be bodily displaced within the energy shell with the velocity 
of the rotation. Such a behaviour is by no means of mere academic interest. To under- 
stand its physical significance, it will be necessary to look a little more closely at our 
mixing process in phase space. Like every mechanical process, it is not, of course, 
truly irreversible; on the contrary, it exhibits the peculiar quasi-periodicity, arising 
from its invariance for time reversal, which is expressed by Poincaré's theorem. 
In the long run, any deviation from uniformity in the distribution, however large, 
is bound to recur again and again; the average periods of recurrence, however, in- 
crease so rapidly with the number of degrees of freedom of the system аз to exclude 
the possibility of ever observing the recurrence of large fluctuations on the macro- 
scopie scale. Now, this is just the point with respect to which our rigid rotator is 
anomalous. For this system, fluctuations of arbitrary amplitude around the ergodic 
average of any phase function occur with the frequency of the motion itself; the 
ergodic average will then not have the clear-cut physical interpretation we wish to 
ascribe to it. 

` The mathematical analysis of the mixing process has been carried out by E. Hopf*, 
and one of his results, which admirably answers the physical requirements, fully 
deserves the dignity of second ergodic theorem. Roughly speaking, it supplements 
Birkhoff’s theorem by giving the additional condition under which fluctuations 
average out in the course of time, thus guaranteeing the adequaey of the ergodic 
average as representative of macroscopic quantities. We are here concerned, of course, 
not with the instantaneous values f(P,) of phase functions, but with the "coarse 


` 4 Hopf 1932. See also Hopf 1934, especially p. 92, and Hopf 1937, p. 37. 
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estimates" of such functions, obtained by smoothing them out over some arbitrary 
phase distribution, e.g. a finite cell around some phase, specified by a density function 
D(P). At time t, the coarse estimate of f(P) will be given by the integral: 
ff (P) D(P)) do, where the transform P, of P has to be expressed as a function of P 
and £. The mixing process may then be described by saying that while the estimate 
just defined may in the course of time repeatedly deviate by any amount from its 
ergodic average F, the time average of these fluctuations (taken absolutely, e.g. quad- 
ratically) will tend to zero as it is extended over longer and longer times: 


T 


T gt - 
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Hopf’s ergodic theorem states the necessary and sufficient condition for the 
validity of formula (4). In order to formulate this condition, let us consider the phase 
space representing pairs of phases (Р, Р”) and the automorphism defined by the 
direct product of the Hamiltonian group transformations applied to P and P’, re- 
spectively. А trajectory in this product space will thus represent a pair of trajectories 
of the original system, contained in the two energy surfaces defined by the phases 
P, P'. By taking all possible pairs of such energy surfaces, within the energy shell 
defined by the distribution D(P), we define an energy shell in the product space 
which contains all the trajectories in this space corresponding to the product distri- 
bution D(P) D(P’). Now, it is a simple matter to transform the time average occurring 
in formula (4) into a time average in the product spacé, which reduces the proof of (4) 
to an application of Birkhoff's theorem to the whole energy shell in the product space. 
The condition for the validity of (4) is therefore the metrical indecomposability, with 
respect to the product Hamiltonian group, of almost all energy surfaces of the shell 
in the product space. Thus, while the existence of the ergodic average depends on 
a property of single trajectories, the mixing process involves a further condition 
applying to pairs of trajectories. The meaning of this somewhat abstract condition 
may be elucidated by our example of the rotating motion around an axis. If the an- 
gular velocity is not a constant, but a definite function c (J) of the angular momentum, 
the mixing process will in general occur, the initial distribution: becoming more 
and more distorted and diluted; it is only required to exclude any rational propor- 
tionality between the angular velocities (except possibly for а set of pairs œ (J) : œ (J") 
of measure zero), and this is precisely equivalent to Hopf's condition. 

The last, but not the least interesting, question to examine is that of the fulfil- 
ment of the condition of metrical indecomposability for actual physical systems. 
'This condition obviously plays the part of the original "ergodic hypothesis" of 
Maxwell and Boltzmann, the inadequacy of which had been proved, as soon as the 
methods of the theory of sets became available, by Rosenthal (1913) and Plancherel 
(1913). The great advantage of the exact formulation of this hypothesis is to make 
it amenable to more detailed analysis. Such an analysis is also very much needed, 
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since, as we shall see, the notion of metrical indecomposability requires an important 
modification to adapt it to the cases of physical interest’. 

Let us first mention a rather trivial correction. Strictly speaking, the energy 
surface can never be metrically indecomposable, since (as is easily seen) any time- 
independent integral of the motion, distinct from the energy integral, can be used 
to effect a decomposition of the surface into two invariant sets of positive measure, 
in which the integral in question is, respectively, larger and not larger than some 
suitably chosen value. Thus, the manifold containing a given trajectory must be 
defined by assigning a definite value, not only to the energy integral, but also to 
those other time-independent integrals (such as momentum and angular momen- 
tum) which can be fixed by external conditions and are thus, in principle, under 
the control of the macroscopic observer. However, for most practical purposes, it 
is permissible to assume that those other "controllable" integrals have a constant 
value over the greatest part of the energy surface (e.g. that the system is connected 
with some auxiliary body so heavy that it remains practically at rest), so that the 
identification of the reduced manifold with the entire energy surface only involves 
a negligible error. 

In general only a few of the distinct time-independent integrals, which together 
determine the shape of the trajectory, are "controllable"; this is intimately connected 
with the circumstance that the same physical state may correspond to a whole set 
of phases, and that phase functions representing physical quantities must accordingly 
be uniform, in the sense that they must have the same value for all equivalent phases: 
only uniform integrals can be regarded as controllable. We have already discussed 
the case of systems of identical elements, with the distinction of specific and generic 
phases it entails; another example is afforded by angular coordinates, which are 
only determined physically modulo 2л, so that only phase functions which are pe- 
riodic in these coordinates can have a physical meaning. The system formed by 
two uncoupled rigid rotators with different fixed axes illustrates in a simple way 
the occurrence of both uniform and non-uniform integrals. 

Now, a non-uniform integral will also give rise to a decomposition of the energy 
surface (or of the reduced manifold defined by the uniform integrals) into invariant 
parts of positive measure. But if this decomposition is such that no two phases of 
any part are physically equivalent, it will not impair the validity of the ergodic theorems 
provided that their application be restricted to uniform phase functions. In the example 
of the two rotators, we clearly get a complete picture of the physical situation by 
restricting the range of the angular variables to the interval (0,27) and imagining 
the successive segments of the trajectory referred, by appropriate translation, to the 
part of the energy surface corresponding to these intervals. In order, therefore, to 
cope with the very general occurrence of equivalent phases, we must substantially 


м Е 
5 For all the following discussion see Khinchin (1949), Chapt. Ш. 
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modify the condition of metrical indecomposability and the scope of the ergodic 
theorems. We shall call a decomposition of the energy surface into two invariant 
parts of positive measure essential when all physically equivalent phases belong to 
the same part. The surface will then be metrically indecomposable in the physical 
sense when it does not allow of any essential decomposition. It is this property which 
guarantees the applicability of the ergodic theorems to uniform phase functions. 

Considering the most important case of atomistic systems composed of identical 
elements, the modification just indicated of the ergodic theory is exactly what is 
needed to establish the relation between mathematical expressions involving com- 
plexions and their interpretation in terms of constellations. How essential this reduc- 
tion is for the consistency of the whole atomistic picture will appear, in particular, 
if we ask for the definition of the relaxation period necessary for the macroscopic 
equilibrium state to establish itself. To visualize this process, the concept of con- 
stellation is clearly indispensable: for otherwise we should be led to the absurd con- 
clusion that it would require several of the recurrence periods defined by Poincaré’s 
theorem. We must, however, imagine the trajectory issuing from the domain of phase 
space which represents the initial constellation and traversing successively the domains 
of the most varied constellations, which form a chaotic entanglement; the constella- 
tions corresponding to the equilibrium state, occupy such an overwhelming 
part of the energy shell that, however improbable the initial constellation, the trajec- 
tory is bound to enter very soon some “prong” of an equilibrium constellation; in 
fact, if the number of elements and the energy are taken to correspond to usual 
physical conditions, this will happen in atime of the order of magnitude of the ob- 
served relaxation periods. 


$4. Quantal statistics of isolated systems 


We shall now review the ergodic problem on the basis of quantum theory and 
try to bring out the complete parallelism with the classical case. From the formal 
point of view, the correspondence between the classical and quantal concepts and 
Jaws which come into play is exhibited by the following comparison: 


A classical state of the system is re- A quantal stale of the system is 
presented by a phase P. A distribu- 
tion over phases is defined by a density 
function D(P), and the average of 
any phase function f(P) over this di- 


stribution by the integral ` 
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represented by a state-vector y. Ап 
arbitrary statistical distribution is de- 
fined by a density operatorë D, and 
the expectation value of any operator f 
for this distribution by the trace 


tr (fD). 


6 This is v. Neumann’s projection operator U, normalized to unit trace, i.e, D = [trU]~1U. Cf. 
v. Neumann (1932), p. 41. ў 
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The time evolution of the state P The time evolution of the state v 
is represented by a trajectory defined | is represented by a "rotation" of the 
by the Hamiltonian group transforma- | state-vector defined by the transfor- 


tion mation i 
Р-Р. cA T. | 
Any distribution density D(P) obeys | where H is the Hamiltonian operator. 
the differential equation, expressing The density operator D obeys the 
Liouville's theorem, differential equation, equivalent with 


Schródinger's equation, 


р) 0 
г ллы (0) 
where (H,D) denotes the Poisson [ а i 


bracket. | where [H, D] denotes the commutator. 


For the discussion of the conditions of validity of the ergodic theorems, it will 
be necessary to assume (just as in the classical case) that the system considered is 
enclosed within a finite volume; as a consequence of this hardly restrictive assump- 
tion, the energy spectrum E; is discrete and defines a complete orthogonal system 
of eigenvectors p; with associated density operators Dg;. The analogue of Birkhoff's 
theorem is here an immediate consequence of the fundamental equation (6). According 
to this equation, the time dependence of any density operator D, expressed by 

i i 


"en DONNE 


у v (E, — Е») ba 
= e 


is almost periodic (in the sense of H. Bohr), and the expectation value tr[D(¢)f] of 
any operator f has accordingly a time average. In general the time average will depend 
on the detailed structure of the initial distribution D(0); but if (and only if) the energy 


spectrum is non-degenerate, the time average (DD of the distribution takes the simple 
form 


«D» = M D,D(0)D,, 
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In this formula, the initial distribution occurs only through the coeffici 

tr [Dş;D (0)], which represent the statistical weights with which the ы д Seay 
states enter into the composition of the initial distribution. Thus, the oper. t oS 
defines a statistical distribution formed by a "mixture" (inv. Мере э Ur 
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of energy states with statistical weights which can be chosen arbitrarily: this is indeed 
the closest possible analogue to the ergodic distribution over an energy shell, and 
the condition of non-degeneracy of the energy is the pendant of the metrical inde- 
composability of the energy surface. 

The element of uniformity introduced by the time average is well brought out 
by considering the special case of the distribution Dy corresponding to some state- 
vector y, which we can expand in the form y = суру. While such a distribution 

J 


expresses in its most elementary form (as a "pure case" according to у. Neumann’s 
nomenclature) the statistical feature introduced in quantum theory on the micro- 
scopic scale, its time average has a more macroscopic character, inasmuch as it only 
depends on the probabilities л; = | c; |? and no longer оп the complex amplitudes 
с; = л? exp (та,) themselves. In fact, the whole time dependence of the distribu- 
tion Dy is contained in the arguments «,, and the time average is identical with an 
average over these arguments, assuming that they are distributed at random. It is 
this uniform distribution of the arguments «; which corresponds to the uniform phase 
distribution in the classical energy shell. 

The discussion of the "mixing process" leading to the ergodic distribution <D) 
has been carried out by v. Neumann (1929). The first problem is how to construct 
the analogue of a "coarse" subdivision of phase space. To this end, we may try to 
replace non-commuting canonical variables р, 4 by a pair of commutable operators 
P, Q which approximate them with reciprocal latitudes not exceeding considerably 
the limit indicated by the indeterminacy relation; it can be shown that this is actually 
possible. We obtain in this way a set of commuting variables, by means of which we 


‘redefine all operators of physical interest, including the Hamiltonian: the new opera- 


tors may be regarded as "coarse estimates" of the actual ones. Some suitably selected 
set of "coarse" operators can now be used to fix a complete orthogonal system of 
state-vectors wp, such that to each distinct set of eigenvalues of our operators there 
corresponds a finite subset of state-vectors, defining a "cell" characterized by a density 
operator d = s 1Z Dy, where the sum extends over the s state-vectors of the cell. 
To a definite eigenvalue of the "coarse" Hamiltonian there will correspond a "shell? 
with the density operator Dg = S-1X Dy;, comprising a subset of 5 eigenvectors g;. 
Since the gy, and cy are linear combinations of each other, we may also write Dg = 
= 8-17), and thus consider the shell аз an assembly of a certain number N of 
cells: | 


Any initial distribution D within the energy shell has the time average Dy, provided 
that (as we shall henceforth assume) the energy is non-degenerate. 

We may now formulate von Neumann’s theorem, which indicates under which 
conditions the time average of the quadratic fluctuation 
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lim — | Ctr [D(F] — tr(DzF))?dt, (8) 


for any "coarse" operator F, can be said to vanish. There is, as in the classical theory, 
an additional necessary and sufficient condition on the Hamiltonian, viz. that there 
be no "energy resonances", ie. no two energy differences E; — E, equal to each 
other; the parallelism with Hopf's theorem need not be stressed. The vanishing of 
the time average (8), however, can only be established in the sense of probability ; 
i.e. one shows that the probability for it not to vanish can be made arbitrarily small 
by suitably fixing the coarse subdivision of the energy shell. 

A brief outline of the proof of von Neumann's theorem will make this last point 
clearer. In the first place, an upper bound for the expression (8) can be derived by 
a straightforward calculation, in which use is made of the condition of no resonances 
to eliminate the structure of the initial distribution D; one gets 


tr(D F?) AN м Mj + У лу, My} 
7 gr 
with 
N Е N 
M,= S 201500,4) —1, M, = № s, Str(D, d, Dp, d,). 


y= 1 pl 


In this expression, the operator F only enters through the ergodic average of its 
square, and the initial distribution D through the probabilities л; of the energy states. 
The coefficients M;, М;,, on the other hand, depend on the choice of the cell sub- 
division symbolized by the d,. All we can do is to study the distribution of the values 
of the Му, Mj, as the set of eigenvectors w, is rotated with respect to the ф;, and 
to estimate the probability that the upper bound of these coefficients be larger than 
some assigned value &. This is a delicate and tedious investigation’, which finally 
yields for the probability just mentioned an expression of the form 


const. exp {— bV S/N + 2 log 5), 


where b is a number of the order of unity; this estimate is valid for & > {2N/(S — 2)}. 
To make this probability vanishingly small, we have only to take 


SIN > (2 log S)2, 


ie. to ensure that each cell of our coarse subdivision contains a large number of 
eigenstates of the Hamiltonian: a quite reasonable requirement. 

Summing up the results reached so far, we conclude that in spite of the deep- 
lying disparities in the fundamental atomistic conceptions, classical and quantum 
Ре M ee eee 

7 This part of у. Neumann's analysis has been simplified and improved by Pauli and Fierz (1937) 


об 
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theory lead to essentially the same description of the basic statistical features which 
account for the macroscopic behaviour of isolated systems. The ergodic theorems 
are hardly of any direct practical use, but, as we shall proceed to show, they provide 
the natural starting point for the derivation of statistical laws adapted to the treatment 
of all the thermodynamical processes. 


85. Statistics of non-isolated systems 


The interactions of a physical system composed of a large number of elementary 
constituents with its surroundings are of two kinds: there are dynamical interactions, 
involving exchange of energy, momentum and angular momentum; there is also 
a flow of the atomic constituents themselves into and out of the system, with an 
accompanying exchange of mass and perhaps electric charge or other substantial 
properties. Strictly speaking, the two types of exchange cannot be separated; in 
particular, according to quantum theory, no physical system in interaction with 
others can be regarded as having a fixed number of elementary constituents, since, 
with an unlimited supply of energy from the surroundings, we cannot exclude the 
occurrence of creation and annihilation processes. However, it is of course a quite 
useful approximation in many cases to neglect such processes, and it is convenient 
to distinguish closed and open systems according as the number of elements is fixed 
or variable, and to treat separately these two classes of non-isolated systems. More- 
over, it will be sufficient to restrict the analysis of dynamical interactions to the ex- 
change of energy, which is the only one of interest for the theory of heat, and which 
is anyhow typical of the kind of argument to be applied to other dynamical processes 
if necessary. 

It is in fact on the process of energy exchange that the concept of temperature 
is based. If we want to ascribe any system a definite temperature, we must bring it 
into thermal contact with a "thermostat", i.e. allow it to exchange energy with a system 
whose heat capacity is so large that it maintains its temperature constant to any desired 
approximation. The concepts of energy and temperature thus refer to mutually ex- 
clusive situations, which afford an especially clear example of the general comple- 
mentary relationship between the dynamical and thermodynamical aspects of the 
phenomena. The synthetic character of the atomistic conception reveals itself strik- 
ingly in the existence of a quantitative relation between the temperature and the 
average energy of the system: we here see again how the introduction of a statistical 
element provides the necessary link between the complementary aspects. 

The statistical distribution characterizing a closed system of given temperature 
reflects this dialectical situation by an essential difference from the ergodic distri- 
bution pertaining to systems of given energy: it is not limited to an energy shell, 


` but extends over the whole domain of possible states of the system and, in particular, 


assigns definite statistical weights to the different energy shells. To derive this law, 
the system we must cofisider is formed by the association of the body which we de- 


€ 
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бо 


scribe аз a closed system® and the thermostat with which it exchanges energy. The 
latter must be supposed to be so large that the total system may always be regarded 
as isolated, and accordingly subjected to an ergodic distribution law. Our aim must 
be, then, to separate from this distribution of the total system that of the body by 
eliminating the variables which describe the state of the thermostat, so that in the 
final result, the influence of the thermostat is only represented by one parameter, 
related to the temperature. This can be achieved by suitably idealizing the condi- 
tions of the problem. 

Although it is quite essential for the whole argument that there should be an 
interaction between the body and the thermostat, it is immaterial how large this in- 
teraction is. It is therefore permissible to assume it to be infinitesimal in comparison 
with the energies of the body and the thermostat considered separately. This allows 
us to treat the total system formally as a mere juxtaposition of body and thermostat, 
the total Hamiltonian being just the sum of the Hamiltonians of the two components. 
We may go a step further, and avail ourselves of the arbitrariness in the specification 
of the thermostat to introduce more symmetry into the question: we assume that 
the thermostat itself is a juxtaposition of a very large number of components (each 
of them of quite arbitrary structure), which are only bound together by negligibly 
small interactions; the body under investigation then appears as just one of the many 
units composing the total system. Our problem is thus reduced to a form of great 
simplicity and generality: if an isolated system is. made up of a very large number 
of components with negligible interactions, to find the statistical distribution of 
any one of the components. 'The formulation and solution of this fundamental prob- 
lem is one of Boltzmann's (1871) great achievements; he had especially in view the 
derivation of the distribution law for an ideal gas of molecules of complex structure?, 
but he was well aware of the universal scope of the argument. 

It is instructive to go a little more closely into the technical details of the analysis. 
Let us start with the system composed of the body and the thermostat. A state of 
this system will be represented 


in classical theory by a phase in quantum theory by a state vector 


PAZ (2, ВЫ V. = V X Ve 


of the "product space" obtained from | which is the direct product of two 
the phases P, Pg of body and thermo- | state-vectors y, фу pertaining to body 
stat. and thermostat, respectively. 


Е Ic Hu eZ EME em TTE. 
8 It may be a material body or a radiation field within a finite enclosure. 
° Tn this case, the components are identified with the individual molecules, while the gas itself 
plays the part of the thermostat. It would seem at first sight that we here run into the inconsistency 


of assigning to the gas both a definite temperature and a definite energy; but the latter must, strictly 
speaking, be thought of as infinite, 
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The Hamiltonians H, Н, and 


H, = H + H; 


define in the respective phase spaces 
sets of energy surfaces S(E), 5, (Е) 
and 5, (Е) with elements of invariant 
measure 


do (E), до (Е), do (E). 


It is easy to see that the element of 
measure do,(E) which serves to com- 
pute the ergodic average on the sur- 
face Е of the total system can be ex- 
pressed as 


do,(E) = | dede (s) dog(E — е); 


that 
associate in all possible manners pairs 


this formula means one must 
of energy surfaces in the body and 
thermostat spaces in such a way that 
their energies have the sum Ё, inte- 
grate first over each pair and then 
over all values of the energy = charac- 


terizing each pair. 


define sets of eigenstates Ф, 9, 9, 
with corresponding "decompositions of 


unity" 19 


dU,(E), dU(E), dU,(E). 


It is easy to see that the operator 
dU,(E) which represents the density 
operator of the ergodic distribution in 
the energy shell (E, Е + dE) can be 


expressed as 
dU(E) — /40(=) x 40Е — е), 


the integral being interpreted in the 
sense of Stieltjes, as explained by v. 
extended over the 


Neumann, and 


whole range of values of the energy e. 


In this way, any “coarseness” in the distribution of the total system is entirely 
attributed to the thermostat, while the body has a "fine" distribution in energy. 
The ergodic average F of any quantity pertaining to the body, and accordingly 


represented by 


a function f(P) in the body space, | an operator f acting upon the y's, 


may thus be written in the form 


poe fa Fe) u(e) m Md ; 9) 


‚ (Е) 


in this formula, F(e) represents the ergodic average for the value ¢ of the energy 


of the body, a quantity given by 
F()E() = [ УР) а 


S(e) 


F(e)tr[dU(e)] = їт[/4Ш(=)] 


10 See у. Neumann 1932, р. 59 sqq, In our case of a discrete energy spectrum, we have (ibid. p. 65) 
dU(E) = Dy. , the sum extending over all the eigenstates in the interval (E, E + dE), 
i à 
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and completely independent of the thermostat; the factors u(E), ug(E), u (E) are 
defined by expressions of the type 

u(E) = Z(E) | u(E)dE = tr|dU(E)]. 
One has 


u,(E) = f de u(e)u,(E — г), (10) 


and it is only the ratio ив (Е — e) / u,(E) which represents the influence of the thermo- 
stat on the statistical weight of the energy value e. 

Boltzmann’s original argument proceeds from this point to a direct estimate of 
the weight factor ug(E) for a very large system of weakly interacting components. 
If the number of components is N, the expression ug(E) depends on the energy by 


3 
the factor E^. Writing E — $ №, we therefore get, asymptotically, 


3 
ив(Е — =) ~ Ез“ (1 — = 
Since further, by (10). 
EN 
u (Eje ЕЗ Ze f de ule) exp (— 2/0), 
we finally get the familiar result 


us (E — =) 


и, (E) = 2-1 exp (— =/0). 


This part of the argument, however, has recently been refined in а most interesting 
way by Khinchin (1949, Chapt. IV and V), whose beautiful little treatise presents 
a penetrating analysis of the mathematical structure of the classical theory. Besides 
being more rigorous than Boltzmann’s, and thereby providing the basis for a direct 
treatment of fluctuation problems, Khinchin’s derivation is remarkable because it 
illuminates the intimate connexion between Boltzmann’s distribution law and the 
very foundation of the calculus of probabilities. 

Khinchin’s argument is based on the remark that if we consider the total system 
of body and thermostat as composed of N +1 non-interacting systems, we have 


between the corresponding quantities uj(E) (i = 0, 1, 2, ..., N) and that, u,(E), be- 
longing to the total system a relation | 


5 
МЕ) = f дец) | anota fom fane e, an 


is 
precisely of the same form as the "law of composition" expressing the probability 


N 
density u(E) of the sum Е = Ze; of (N + 1) stochastic variables with probability 
i=0 


1 
densities и;(=;). The u's cannot be immediately identified with such probability 
densities becáuse they cannot be normalized, but this is readily remedied by multi- 
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plying them with exponential factors of the form exp (—fe;), where В is an arbitrary 
positive parameter; i.e. the quantities 


ne) = (ZP) ule) exp (Be), (12) 
with 
Z(B) = [ de X9) exp (—Be) 2,08) = f tr [dU;(e)] exp (—Be) 
= Í dw exp (—ВН,). = tr [exp (—8Hj)], (13) 


are properly normalized probability densities satisfying a composition law of the 
type (11). Now, to these densities, formally "associated" to our systems by the rela- 
tions (12), we may apply the central theorem of probability theory, which gives an 
asymptotic form for large N of the probability density z (E), valid for arbitrary den- 
sities л;(є;). Using this form both for the density @,(E) associated to the total system 
and that, ль(Е — г), associated to the thermostat, and returning to the u's by means 
of (12), we get (with neglect of unimportant fluctuation factors) 


ив (Е — € 
MW ) — {2 (p) exp (— Bà), (14) 


with the normalization factor Z(ß) defined by (13) in terms, of course, of the Hamil- 
tonian H of the body. All that remains to be done to establish (14) as (in Gibbs’ 
terminology) the "canonical" distribution law is to relate the formally introduced 
parameter В to the temperature; but we need not dwell upon this point. 

Let us now turn to the open systems. "This is the new field opened up by Gibbs, 
and in which his powerful methods reign supreme. Not only did he develop single- 
handed the whole thermodynamical treatment of these systems, dominated by the 
concept of chemical potential, but he also gave the key to their statistical analysis 
by his definition of "grands ensembles" obeying a distribution law with respect to 
the numbers of constituent elements; built up on the same pattern as the canonical 
distribution in energy. Faithful to his platonistic attitude, however, he did not attempt 
to link up the new conceptions directly with the physical situation they were intended 
to describe, but relied for their justification upon the wide range of their successful 
application. Yet such a direct link is by no means difficult to establish. From the 
thermodynamical point of view, Brensted (1933) showed by a simple and elegant 
argument, involving only the usual manipulation of idealized pistons and semiperme- 
able membranes, that the concentrations of the various constituents of a chemical 
system can be changed in a quasi-static way without modifying either the entropy 
or the other thermodynamical variables; thus justifying Gibbs’ procedure of treating 
these concentrations as independent variables in his definition of the chemical poten- 
tials. As regards the statistical theory of the "grands ensembles", it is clear that it 
should find its foundation in a relationship similar to the body-thermostat inter- 
action, but involving in this case an exchange of matter between the body under 


, 
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consideration and some very large reservoirs of the various species of elementary 
constituents whose numbers are allowed to vary. The way/in which such an argument 
can be developed has been sketched by Tolman (1940), who, however, confined 
himself to classical theory and, moreover, left out any discussion of the points raised 
by the identity of the elementary constituents. Tt will therefore not be superfluous 
to re-examine the problem. 

To avoid unessential complications, we shall just consider a homogeneous body 
exchanging elementary constituents (which we shall call "molecules") with a large 
reservoir of the same kind. The total system of body and reservoir will be assumed 
to contain a fixed number of molecules, and we shall moreover take it to be in thermal 
equilibrium at a given temperature. Аз to the mechanism by which molecules are 
exchanged between the body and the reservoir, we may idealize it arbitrarily without 
altering the ultimate equilibrium distribution. The macroscopic state of the reservoir 
will depend on the mechanism of molecule exchange, but this is immaterial for the 
argument; it is only necessary to realize that, for a given exchange mechanism, the 
state of the reservoir is fixed by that of the body. We shall imagine that molecules 
are only allowed to pass so slowly between body and reservoir that there is no appre- 
ciable interaction energy between these two components of the system, and that the 
variation of the number of molecules in the body appeárs as a succession of indepen- 
dent chance events. Since the body is very small compared with the reservoir, the 
a priori probability of a given partition N + N’ of the total number N, of molecules 
between the body and the reservoir is accordingly expressed by a Poisson distribution 
law 


p(N) = (15) 


around some average value N, which is part of the external conditions. The use of 
the Poisson distribution in this case is entirely similar to the application of the central 
theorem to the partition of energy betweeen body and thermostat. 

Now, for any given partition, characterized by the number N.of molecules in 
the body, the macroscopic value of an atomistic quantity pertaining to the body is 
(owing to the additivity of the Hamiltonians of body and reservoir) the usual cano- 
nical average Ру, which of course depends on the number N of molecules. For the 
"grand ensemble" formed by all possible partitions, however, we must take a weigh- 
ted average of the Fy's taking account of the presence of the reservoir; our task is 
to reduce the weight factors to a simple form in which the reservoir is represented 
by only one parameter, related to the chemical potential. To explain how this pro- 
gramme is carried out, we must specify a little more closely the formal treatment of 
atomistic quantities relating to systems with a variable number of molecules; such 
quantities are of the general type 


N : N N 
ЛР) S yy pt y palco aptus (16) 


jx i=1 j=1 
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where, in classical theory, the phase 
P,, is formed of the individual phases 
Pi 1,2,... №) of the molecules. 


If H(Py) denotes the body Hamiltonian, 


Fy = GN)! | ао РА) „-ВЫ(Ры), 
with 
Z(N) = [den o PH (PN) 


The integrations are taken over specific 
phases; but the 
phases, effected by a division of each 
integral by N!, does not affect the 
average Fy itself. 


passage to generic 


where, in quantum theory, the ”con- 
figuration space" Py if formed of sets 
of variables P,(i = 1, 2, ... N) defin- 
ing the individual states of the mole- 
cules. 


the canonical average Fy may be written 


(А нү уН 


ына). 


the trace being taken with respect to 
a subset of symmetrized or antisymme- 
trized eigenvectors corresponding to the 
given number N of molecules. 


When we take the canonical average for the total system, the reservoir contributes 
a weight factor Z,(N,— N), defined in the same way as Z(N) in terms of the reser- 
voir variables, and referring to the number N,— N of molecules in the reservoir. 
The statistical average over the "grand ensemble" thus takes the form 


САМ) 
F = M в (№) Ем (М) | 
> 


LATET ай 


with 


Z, (N) = У w(N)Z(N)Z, (N,— N); (18) 
N . 

the structure of these formulae is fully analogous to that of the formulae (9) and (10) 
referring to the energy partition over the body-thermostat system. 

The a priori weight factor w (N) has, | In the quantal formula, we may take 
in the classical case, just the Poisson | w (N) = 1, since the а priori weight- 
expression p(N) given by (16). ing 13 included in the restriction to 
symmetrized or antisymmetrized eigen- 
of the 


vectors in the computation 


traces. 


The elimination of the reservoir variables rests upon the remark that the quan- 
tity log Z,(N'), taken for the equilibrium value № of the molecule number in the 
reservoir, is directly related to an extensive thermodynamical function and is there- 
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fore (in our simple case of an homogeneous system) proportional to N’. For suffi- 
ciently small deviations from the equilibrium state, it will therefore be permissible 
to write 


ZU) (м (м 19) 
Ж? (№— №) z = (М —N) 2, (19) 


where the reservoir function z can be ultimately expressed in terms of the thermo- 
dynamical variables, including N, which specify the macroscopic state of the body; 
large fluctuations of N, on the other hand, are so seldom that there is no appreciable 
error in applying formula (19) to the whole range of values of N. Introducing a para- 
meter ё by the definition 


pe = — + log М, | Bb = —z, 


and disregarding factors independent of N, we may therefore re-write the formulae . 


(17), (18) in the form given by Gibbs: 


ZN à 
Fy A ек Ў try (fe 29) ем 
сга ы. РЕ, (20) 
x Z(N) eßeN У try (е-ВН) евМ 
N! E 
N 


the parameter ¢ can be identified with the chemical potential. 

In the classical expression (20), the explicit occurrence of the factor {М№!}-1, 
which has the effect of reducing the weight factor Z(N) to generic phases, appears 
here as a natural consequence of the Poisson distribution of the partitions of the 
molecules. It once more illustrates the essential part played by the consideration 
of generic phases, which accounts for the identity of the elementary constituents, 
for the interpretation of the formalism of classical statistics. In fact, the factor in 
question is quite indispensable to secure the fulfilment of the product relation 
W(N, + Ne) = W(N,) W (N,), which must hold between the weight factors corre- 
sponding to a homogeneous system of N, + № molecules and its subdivision into 
two pieces containing N, and N, molecules, respectively. 

In quantum theory, the constellations are introduced from the start by the 
process of symmetrization or antisymmetrization of the state-vectors, and there is 
no need to make any other explicit reference in the formalism to the distinction be- 
tween complexions and constellations. In particular, it is possible to treat number 
variables themselves as quantal operators by expressing them in terms of suitable 
annihilation and creation operators, and to base on such operators a mode of repre- 
sentation of all physical quantities pertaining to systems of identical elements, in 
which this identity finds an immediate expression. 

By considering the variable N in the quantal formula (20) as the eigenvalue of 


d 


The Foundations of Statistical Thermodynamics 29 


an operator (denoted by the same letter), we readily obtain the more compact ех- 
pression 


tr( fe PR + KN) 

= (ен + RN)? (21) 
where the trace is now extended to the full set of eigenvectors covering the whole 
ensemble of configuration spaces for all possible eigenvalues of N. The immense 
advantage of this formula over its classical counterpart lies, however, in its invariance 
with respect to the mode of representation of the operators, also in their dependence 
on the number variables. Thus, if we use the representation just alluded to in terms 
of annihilation and creation operators, we arrive at a very simple treatment of open 
systems, especially of those with negligible interactions between the molecules: 
indeed, the distribution laws of ideal gases of bosons or fermions are then immediate 
consequences of the formula (21). 


$6. Statistics and thermodynamics 


The knowledge of the equilibrium distribution law suffices, in principle, to 
derive all the macroscopic properties of the system in terms of the macroscopic va- 
riables which fix the external conditions. In thermodynamics, the properties of the 
system are similarly concentrated in a single characteristic function of the indepen- 
dent variables chosen to define the state of the system. Thus, for isolated systems, 
for which the energy (or enthalpy) is an independent variable, the characteristic 
function is the entropy; for non-isolated systems of given temperature, it is the 
free energy, which occurs in several variants according to the choice of the non- 
thermal independent variables. The establishment of the link between statistics and 
thermodynamics therefore reduces to finding in each case a statistical interpretation 
of the characteristic function“ It is well-known that the solution of this problem is 
not unique, but that for large systems the ambiguity does not matter: this situation 
has been elucidated in a masterly fashion by Lorentz (1916), who called it the "insen- 
sibilité" of the entropy definition. 

The proof that a given expression built up by means of the statistical distribution 
law may be identified with a characteristic function consists of two parts, correspond- 
ing to the behaviour of the latter function under quasi-static and irreversible transfor- 
mations, respectively. А controlled change by da, of the independent variables a; 
results in a well-defined change dG = 24; da; of the characteristic function G. 
If, on the other hand, a spontaneous transformation brings the system from one 
state of equilibrium to another, one can only establish an inequality G(a’) — G(a)<o 
(say) between the initial and final values of the characteristic function; such ine- 
qualities can all be reduced to the corresponding entropy increase of the isolated 
system including the system investigated and its surroundings. It is well-known from 
Sadi Carnot’s pioneering analysis (Carnot 1824, p. 23, sqq.) that any quasi-static 
transformation can be decomposed into a succession of infinitesimal steps, each of 
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which is either an isothermal or an adiabatic transformation. The latter is idealized 
as a sudden infinitesimal change of some non-thermal variable, after the system 
has been severed from the surroundings, followed by an infinitely long period during 
which the system is allowed to reach the state of equilibrium corresponding to the 
new value of the variable in question. Conversely, all irreversible transformations 
can be reduced to two types: either an adiabatic change in which the periods of recov- 
ery following each infinitesimal step are of finite duration, or a process of exchange 
of heat between bodies of different temperatures. 

The discussion of quasi-static transformations is not so straightforward as it 
looks. Thus, if we are dealing with isolated systems, we have no means of defining 
the temperature except in a purely formal way, as the inverse of the partial derivative 
9S[9 E of the very entropy function S for which we want to establish the expression 
in statistical terms: the suitability of such an expression must therefore be tested 
by the verification of equations of the form 
95/да; 


= 


for the non-thermal variables а;; the A; being directly defined as the ergodic aver- 
ages of the partial derivatives 9H/2a; of the Hamiltonian with respect to these variables. 
It is often overlooked, however, that this verification is by itself not yet sufficient for 
a logically complete proof: it is further required to show that the form of the statis- 
tical distribution law is not modified by quasi-static transformations of the external 
conditions. | 

Let us take up again the two points just made for the more interesting case of 
non-isolated systems of given temperature. For simplicity, we shall only consider 
a closed system; we have then to test the identification 


Е = —0 log Z (22) 


of the free energy F with an expression depending on the weight factor Z of the 
canonical distribution.!! The verification is immediate for isothermal transformations, 
for which the variation dF directly expresses the work done on the system by the 
external forces. Moreover, the formula (22) readily yields for the macroscopic energy 
E, which-is the canonical average of the Hamiltonian H, the expression 


ет (23) 


known as Helmholtz’ formula; we use the notation В = 0-1. For the variation dF = 
eee eee eee 

11 Strictly speaking, in order to make the argument of the logarithm dimensionless, we ought, 
in classical theory, to divide all integrals over phase space, including the function Z, by some arbitrary 
"unit" volume w, for which we may take the volume of a cell of the coarse distribution. In quantum 
theory, the function Z, defined as a trace, is dimensionless if the state vectors used to compute it are 
normalized in the usual way; instead of the arbitrary cell volume c, there appears a factor hf, where f 
is the number of degrees of freedom of the system. | 
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= — Sd corresponding to a quasi-static change of temperature 40, we therefore get 

—S = В(Е— Е), (24) 
which formally coincides with the thermodynamical relation between entropy, free 
energy and energy. From our present point of view, this relation must logically be 
regarded as a statistical definition of the entropy, in harmony with the thermodyna- 
mical formalism. However, this argument tacitly assumes that in discussing a quasi- 
static change of temperature we may treat the equilibrium distribution as a canonical 
one with variable temperature. Since the transformation in question, when reduced 
to the two basic types, involves an adiabatic step, our statistical intrepretation of F 
and S will not be completely justified until the permanence of the canonical distri- 
bution under quasi-static adiabatic transformations is established. 

To this end,!? let us start from a state of the system characterized by a tempera- 
ture @ and the value a, of some non-thermal variable. With the help of the Gibbsian 
notion of "probability exponent," 

n = f (F — Н), (25) 
we can represent the normalized distribution density by expn. Let us imagine that 
at a certain instant the contact with the thermostat is interrupted and the value of 
the non-thermal variable suddenly changed by an infinitesimal amount ба from ag 
to a — ag + да. This operation will not at first modify the microscopic state of 
the system, and the ensuing evolution will be that of the isolated system under the 
external conditions defined by the new parameter value a. The density D(a) of the 
equilibrium distribution eventually reached is accordingly given by the ergodic average 
of the initial one, 


1 


D(a)do = Сов ас „ау Е; | D(a) = 32 Da Dy cay} 
here < fJ denotes the ergodic average of un 


| . : | 
quos = FE, fro do, Pa = st ГАР, о, for г (a) = Е,(а), 


$ | : 
о taken with respect to the new Hamil- 


taken. oven the new energy surface &(@) | topian Н(а) = Х Еда) D 
containing the initial phase. j 


We may first compare this expression for D(a) with the density of the canonical 
distribution of temperature 0, for the new Hamiltonian Н (а). Representing by à 
the corresponding variation of 4, we readily get 


12 The following proof was given for the classical case by L. Rosenfeld (1942); it is here carried 
out for the quantal case also. The discussion of the latter case in J. Broer's thesis (1945), is not quite 
¥ 


satisfactory. Е 2 
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= (Boa) m $ 
D(a) = e» [1 — (OT ia) ji D(a) = е" П Di (ÖN 2E;(a) D, у]. 
j 


For the ergodic average of 67 we may write (dropping now the argument a) 
<ön), = DB, A =A 0a, 

where A and A, denote the canonical and ergodic averages of ©Н|да„ respectively. 

Now, if the system is so large that quadratic effects of energy fluctuations can be 

neglected,!? we may identify the canonical average A with the ergodic average Ag 

taken for the average value E of the energy, and replace the difference Ар — A, 

by the first term of its expansion in powers of E — =. We thus see that 


246 , 
(095, = (E — е) В, DE да 


is nearly proportional to the energy difference Е — =. It can therefore be compen- 
sated by a suitable alteration 68 of the inverse temperature to a value В = By + O8; 
for, by Helmholtz’ formula (23), there corresponds to such a change of temperature 
an average variation 
«91, = (E — e) ôf 

likewise proportional to E — =. In other words, we may describe the final distribu- 
tion D(a), for the new external conditions fixed by the parameter a, as being again 
a canonical distribution, with a temperature slightly different from the initial one. 
The condition required for the compensation connects the change of inverse tem- 
perature öß with the change да of the parameter a, by the relation 


OB + В AE да — 0, 


which indeed coincides with a thermodynamical equation characteristic for quasi- 
static adiabatic transformations. 

Turning to the consideration of irreversible transformations, I may be very 
brief, since this aspect of the theory has been thoroughly elaborated by Gibbs (1902, 
Chapt. XI), and the transposition of the argument to quantum theory effected by 
Klein (1931). For systems of given temperature, the statistical expression (24) for 
the entropy can be described, according to (25), as minus the average probability 
exponent of the canonical distribution. If the external conditions are altered, the 
distribution may take an arbitrary form, but the notion of average probability ex- 
ponent can still be applied to it. If the normalized distribution density is represented 
by exp 7, its average probability exponent is defined, in classical theory, by f ne"dw. 
in quantum theory, by tr(ne”). The whole discussion of irreversible transformations 
from the statistical point of view can be based on some simple inequalities which 
these expressions satisfy on account of the concavity of the exponential curve. 


pe АЁ... эйи» An c 5 9 ie Te de 
13 In the quantal case, we further assume that the energy spectrum is so narrowly spaced that 
we may treat the ergodic average А, as a continuous function of the energy e, as in the classical case. 
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The case of adiabatic transformations is covered by the following lemma: "The 
average probability exponent of a distribution with uniform density in each energy 
shell is not larger than that of any other distribution with the same total probability 
for each shell." The limiting case of equality of the initial and final entropies is ob- 
tained with quasi-static transformations, since, as we have just seen, in each infini- 
tesimal step да of such a transformation the first order variation ôS vanishes. In 
any other case, the successive steps will produce irregular variations of the average 
probability exponent until the establishment, with the last step, of an equilibrium 
distribution of uniform density in each energy shell; the lemma then tells us that 
the corresponding entropy is larger than that of the initial state. 

To cope with transformations involving heat exchange between different parts 
of the system, we have a second lemma: "If a system is divided into two parts, the 
sum of the average probability exponents of these parts is not larger than the average 
probability exponent of the total system. The equality holds if, and only if, the two 
parts are simply juxtaposed without interaction." The two preceding lemmas are 
logically sufficient to exhaust all forms of irreversible thermodynamical transforma- 
tions. If, however, one wants a more detailed analysis of the role of the temperature 
'of the bodies involved in exchange of heat, a third lemma proves useful: "If 0 is 
a positive parameter, the sum of the average energy and 0 times the average prob- 
ability exponent isa minimum for the canonical distribution of temperature 0 (for 
which the quantity in question represents the free energy).” By combining the second 
and third lemmas one readily derives the fundamental theorems of Carnot ‘and Clau- 
sius about cycles of transformations in which a body exchanges heat with various 
reservoirs. | 

I shall not conclude this survey of the logical structure of statistical thermody- 
namics without some discussion of the novel type of irreversible process with which 
we are confronted in quantum theory, viz. the process of measurement. Strictly 
speaking, the consideration of this kind of irreversibility falls outside the scope of 
thermodynamics, since it is connected with (ideal) processes affecting single atomistic 
elements. Still, it is instructive to compare the two situations, which, while referring 
to entirely different problems of observation, exhibit a striking similarity in their 
statistical aspect. The feature of irreversibility involved in the measuring process is 
intimately related to the peculiar wholeness” of quantal phenomena, which is their 
most fundamental character; indeed, a quantal phenomenon is only well-defined 
when it is brought to completion by some measuring intervention of the macroscopic 
observer. Just as in thermodynamics, though of course at a deeper level of observa- 
tion, such. an intervention introduces an irreversible element into the observer's 
experience. 

In a measurement performed on a quantal system, the interaction between the 
system and the macroscopic bodies serving as measuring instruments partly escapes 
the control of the experimenter; this has consequences similar to those of the averaging 
process implied in the definition of macroscopic quantities. Even if the system was 
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originally in a well-defined state, its state after the interaction with the measuring 
apparatus will be a statistical mixture of all the eigenstates of the quantity measured: 
the density operator D,, which contains, so to speak, all the statistical potentialities 
inherent in the "pure" quantal state y, is transformed into an operator of the general 


form 


DE x tr (D, D,;) Dy, (26) 


in which one of these potentialities is actually expressed. One may look at this situ- 
ation from the point of view of the theory of information: the measurement entails 
the loss of the information about the previous state of the system; to regain an equiv- 
alent amount of information, one has to perform a "reading" of the measuring 
instrument, i.e. sort out the mixture described by the distribution (26), and this costs 
a certain minimum amount of physical entropy. 

It is interesting to note that Gibbs’ conception of the average probability expo- 
nent is adequate also in this case to give the measure (apart from the sign) of the 
entropy associated in the way just indicated with the process of the measurement. 
In fact, v. Neumann (1927, p. 273; see also 1932, p. 191 sqq.) has shown that this 
entropy is connected with the distribution density (26) by the relation 


S = —tr (D log D). (27) 


According to this formula, all pure states have the entropy zero, while for a mixture 
of pure states like (26), with "concentrations" л, = tr (D, De, one gets the familiar 
expression for the entropy of mixing, — 27; log л;. The justification of the defini- 


Li 

tion (27) requires in the first place the proof that it is possible to transform any pure 
state into any other by quasistatic adiabatic operations involving suitable measure- 
ments, but no sorting out; a sequence of such operations yielding the desired result 
has been most ingeniously contrived by v. Neumann. Once this statistical equivalence 
of all pure states is established, the case of mixtures is easily understood. Let us start 
from an ensemble of N systems all in the state y, and subdivide it into subsets cont- 
aining zr, N25... systems, respectively; we. can then transform the systems of 
the first subset into the state фу, those of the second into the state pg, etc. without 
any change of entropy: all that remains to be done is then to mix up all these sub- 
sets, an operation whose minimum cost in entropy is just given by the usual expres- 
sion quoted above. 

This argument clearly shows that the relation (27) exhibits more than a purely 
formal analogy between the statistical features of two different orders of phenomena: 
it expresses the necessary connexion always to be expected between the abstract 
appraisal of the information obtainable about such phenomena and the thermodyna- 
mical properties of the macroscopic physical processes by which the information 
may in principle be obtained. 
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$7. Dialectical aspects of statistical thermodynamics 


Looking back on the preceding analysis of the atomistic theory of heat, I should 
like, in conclusion, to stress once more the epistemological principles which find 
such a striking illustration in the development of this theory. Above all, we realize 
that only the patterns of dialectical logic are flexible enough to formulate adequately 
the peculiar relationship’ between the atomistic picture and the macroscopic laws 
it purports to explain. Any attempt to follow rigidly a line of deduction exclusively 
based on atomistic conceptions, without paying due regard to the limitations imposed 
upon such deductions by the requirements of the macroscopic mode of description, 
inevitably leads to the impasse characterized by the famous paradoxes of Loschmidt 
and Zermelo!?: the behaviour of atomistic systems does not exhibit any irreversibility. 
The second law of thermodynamics therefore appears, if one attributes it absolute 
validity, as the very negation of atomic theory. Such a dilemma cannot be resolved 
by any superficial patching up of the antithetic conceptions, but only by the intro- 
duction of a radically new point of view, from which these conceptions can be indis- 
solubly integrated into a true synthesis. The unfolding of this dialectical process 
can be clearly discerned in the historical development of the subject. It was actually 
Clausius himself, the discoverer of the second law, who gave the first impulse to 
a renewed analysis on atomistic lines of the thermal properties of matter; and this 
re-assertion of atomic theory found its completion in the emergence of the statistical 
viewpoint, which confers it its new synthetic character. The protagonists of this dia- 
lectical drama were blissfully ignorant of the arcana of hegelian logic, but they keenly 
realized the novelty of the momentous step they had taken: "The modern atomists, 
wrote Maxwell (1873, p. 373) have —adopted a method which is, I believe, new in the de- 
partment of mathematical physics, though it has long been in use in the section 
of Statistics.” 

Yet they were not aware of any break in the developmentof the theory of heat: for 
the basic idea which inspired the pioneers of thermodynamics (with the solitary exception 
of Robert Mayer) had always been the atomistic conception of heat. They thus experienced 
the dialectical transformation of thermodynamics into a statistical theory as a perfectly 
natural evolution within the general framework of atomism. Only at a late stage of 
the discussion were the laws of thermodynamics played up against atomic theory, 
when an attempt was made to revive a pluralistic conception of continuous matter, 
reminiscent of Anaxagoras’ ”homoiomeria,” and involving, like the latter, an idealistic 
view of the "cosmos." Mach (1871), in particular, the most consistent advocate of 
such a conception, developed on the model of the logical structure of macroscopic 
thermodynamics a formal idea of causality, according to which the causal connexions 
of physical phenomena would be exhaustively described by functional relationships 
between macroscopic quantities; he opposed this type of direct causal relations out- 


Rt 


14 See P. and T. Ehrenfest (1902), рр. 22—23. 
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side of space and time to the determinism of classical mechanics, which was then 
thought to apply also to atomic phenomena. Statistical thermodynamics, however, 
introduces a new form of causality, which transcends the two others and eliminates 
their apparent opposition. It is remarkable that in quantum theory statistical causality 
again appears in a similar synthetic role, as part of a mode of description embodying 
the complementary aspects of space-time coordination and energy-momentum con- 
servation. 

I have already pointed out that in statistical thermodynamics just as in quantum 
theory, the notion of complementarity adequately expresses the logical relationship 
between the mutually exclusive modes of description which are synthesized in the 
theory. The existence of a relation of this kind did not escape the sagacity of Max- 
well (1879), when he remarked, for instance, that ”аз soon as we begin to see the 
molecules distinctly, heat becomes resolved into motion.” From this point of view, 
phenomena such as the Brownian movement appear in an interesting light. They 
represent intermediate situations, in which the two complementary modes of descrip- 
tion can be applied with reciprocal latitudes. Thus, if we observe an emulsion of very 
fine droplets through the microscope, we may partially follow the motion of an in- 
dividual droplet, as well as count the average numbers of droplets in thin layers at 
various depths: the individual motion will deviate by irregular fluctuations from the 
course determined by gravity and viscosity, and its character will change from com- 
plete randomness to complete dynamical regularity as the size of the droplet in- 
creases; the variation of droplet density with depth, on the other hand, which for ultra- 
microscopic droplets is accurately characterized by a canonical distribution of definite 
temperature, will deviate more and more from this simple form with increasing droplet 
size. Examples like this illustrate the essential point, repeatedly emphasized in the 
preceding discussion, that the mutual limitation of the complementary modes of 
description is a direct consequence of the difference of scale between the physical 
systems investigated and the tools of human observation. | 

In this respect, the particularly clear and cogent situation met with in quantum 
theory and so acutely elucidated by Niels Bohr has been of paramount importance 
in helping us to realize the insufficiency of current ideas about the relations between 
phenomena and their observation. For the idealist (I am only speaking of the most 
moderate variants of this attitude), scientific concepts are "free creations of the mind” 
projected onto the physical world; for the mechanistic materialist, they are the ”те- 
flexions” in the mind of external events. Common to both conceptions is the assump- 
Чоп that the interaction between the material world and the mind does not appre- 
ciably disturb the course of their separate existences. We have learned, however, 
that the peculiar wholeness” of atomic processes, expressed by the law of the existence 
of a quantum of action, makes such a separation of atomic systems from the condi- 
Hons of their observation impossible: a quantal phenomenon can only be defined in 
terms of these conditions. The occurrence of complementary quantal phenomena, 
as well as the statistical character of the predictions which can be made about results 
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of observations on quantal systems, have their origin in this fundamental fact. We 
are therefore now in a much more unprejudiced position to re-examine the problem 
of observation in statistical thermodynamics. 

The contrast between the macroscopic mode of description of the thermal prop- 
erties of matter and the dynamical analysis of the underlying atomic motions found 
a picturesque, but not altogether harmless, expression in Maxwell’s (1871, Chapt. 
XXII in fin.) fiction of "demons" with exquisitely refined senses, enabling them 
to perceive individual atoms and act upon them. Like the all-embracing "intelli- 
gence" in which Laplace had symbolized the deterministic ideal of mechanistic ma- 
terialism, Maxwell (1873, p. 374) credited his demons with a more perfect kind of know- 
ledge than that attainable by human observation: "when we pass from the contem- 
plation of our experiments to that of the molecules themselves, we leave the world 
of chance and change, and eriter a region where everything is certain and immutable." 
The statistical approach was represented as a pis-aller, and the macroscopic laws 
deduced by such means could not "pretend to absolute precision." Once it is recog- 
nized, however, that statistical relationship can logically subsist on their own right 
and provide, without any "deterministic substratum," a consistent and exhaustive 
account of a field of experience, there is no longer any point in asserting with Maxwell 
that while such relations may be true "for all practical purposes," they lack the final- 
ity attributed to "the laws of abstract dynamics"; or in imagining like the less sub- 
tleminded epigones, that atomic theory "reduces" — if not in practice, yet "jn prin- 
ciple" — the account of all experience to its atomistic aspect. Indeed we are here 
dealing with a typical situation in which any such distinction between "practice" 
and "principle" utterly vanishes: for the decisive role of the conditions of obser- 
vation in determining the mode of description of experience is just as much a matter 
of "principle" as of "practice." — 

This point was already appreciated very clearly by Clausius (1879, pp. 315—316), 
whose robust realism rebelled against the metaphysical misuse of Maxwell’s demons: 
he acutely pointed out that the statistical description of the kind of molecular motion 
which we call heat is a matter of definition, directly arising from the actual im- 
possibility of discerning individual atomic processes with the usual means of obser- 
vation; the intervention of demons destroys the very conditions which give molecular 
motion the peculiar character to which the concept of heat refers. Only the undisputed 
acceptance in his time of the mechanistic dogma of determinism prevented him 
from fathoming the full epistemological significance of the statistical theory of heat, 
as we are able to assess it, now that we have come to realize the essential autonomy 
of statistical causality. 

As rightly stressed by Clausius, the explicit consideration of the human observer 
in framing the concepts and methods of thermodynamics is a logical requirement 
for a proper definition of these concepts. Such an abstract outlook, however, may 
easily mislead people muddled by some idealistic philosophy to the belief that a sub- 
jective element is thereby introduced into the theory. Thus, no less an authority than 
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С. М. Lewis (1930) could make the extraordinary statement that entropy "corresponds 
merely to a loss of information with regard to a state of the system and is thus a purely 
subjective concept." Clausius had a healthier view of the matter: he perfectly under- 
stood that, quite on the contrary, we are here dealing with an essential condition of 
objective description, imposed upon us by the laws of nature which govern the 
process of observation itself; entropy, he tersely remarked, "is not concerned with 
what heat can do with the help of demons, but with what it can do by itself." It is 
indeed remarkable that the development of the atomistic conception of the physical 
world has so insistently, in the theory of heat and again in quantum theory, reminded 
the physicists that science is not a detached contemplation of Nature, but is itself 
a natural process in which man is actively engaged. Our conception of the structure 
of scientific theories is thus more concretely brought into harmony with the social 
function of science. The founders of thermodynamics had no doubt that the task 
of science is — as Marx then expressed it with youthful enthusiasm — "to change 
the world." Deeper insight into the meaning of scientific knowledge, gained from 
their achievement, gives a new impetus to the idea of human progress which inspired 
them in their endeavour. 


КРАТКОЕ СОДЕРЖАНИЕ 


Розенфельд, Об основании статистической термодинамики. 


Целью этого очерка является исследование логической структуры статисти- 
ческой термодинамики и отчет о том, верно ли эта теория отражает своеобразный. 
диалектический характер физической ситуации. Автор попробует описать в об- 
щих чертах, очень кратко, основания статистической интерпретации законов 
термодинамики, подчеркивая ‘при этом сильно физическое значение формальных 
аргументов. Взгляды, которые автор хочет изложить, являются несомненно, зна- 
комыми для тех физиков, которые внимательно познакомились с сущностью 
дела; однако существует еще среди части из них широко распространенная пута- 
ница, которая может послужить оправданием для рассмотрения вопроса с синте- 
тической точки зрения. В связи с этим, дискуссия классического подхода наравне 
с квантовым не будет целиком излишней: настоящее значение квантового под- 
хода можно получить контрастируя его с классическим. 
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This work is devoted to a study of linear unitary transformation which can 
be applied to coordinate and momentum operators. Since these operators 
behave like two components of a covariant spinor with respect to unimodular 
transformations, a formalism is developed (§1) and a notation is introduced analogous 
to that used in the spinor calculus. A linear group of transformations t and a group of 
linear homogeneous transformations v; are introduced in $2. In 83 representations of 
unitary operators are introduced leading to given transformations from тр. А discussion 
of the use of general unitary operators of the form eT is given, T being a quadratic form 
in the momentum and coordinate operators, which lead to certain transformations from т. 
In $5 an explicit form for the operator eT is formed leading to a given transformation 
of v. In $6 a transformation group induced by тр is discussed for three real operators 
which are the quadratic forms in coordinate and momentum operators (physically 
identical with the Hamiltonian, the Lagrangian and the action fuuction of a harmonic 
oscillator respectively). This group proves to be identical with L, i.e. with the restricted 
Lorentz group acting in a threedimensional Minkowski space. In 87 the possibility of 
reduction of quadratic forms in coordinates and momenta to each of the three pos- 
sible canonical forms, effected by means of linear unitary transformations, is discussed. 
In 88, avoiding the problem of an explicit construction of eiT leading to a given 
transformation from т, a simple method is given for finding the eigenvalues of an oper- 
ator which is a general elliptic quadratic form in the coordinates and momenta. Finally 
in $9 by specializing the general theory of the preceeding sections a detailed discussion 
of a number of particular unitary transformations and the corresponding unitary operators 
is given; at the same time the rules for applying these operators to state vectors are 
deduced. The formulae of this section are useful computations directly performed on 
operators appearing in various quantum mechanical problems. ‘In the last (10-th) 
section the question of composing linear transformations and the corresponding unitary 


operators is discussed. 


Introduction 


One often encounters in the quantum theory problems in which the Hamiltonian 
(or other operators considered) are quadratic forms of the coordinate and momentum 
operators. Linear transformations transform given quadratic forms into other quadratic 
forms particularly those which have more convenient properties for practical 
computation. The purpose of this paper is a detailed investigation of that class of 
unitary transformations "which transform coordinates and momenta into linear 


(ш) 


49 L. Infeld, J. Plebanski 


functions of the original ones. The basic problem considered in this work is the fol- 
lowing: Given the real operators é} and £, satisfying the relation [Ei E] = 6351 = 
— é} =i invariant with respect to the unimodular transformations 


a 
& = у hfs + tai а = 1,2; Det || | = 1. 
B=1 
We look for operators eT which induce this transformation, i.e. which satisfy the 
condition e7f/(£) eT = f(E). It turns out that the complete answer to this question 
requires the application of a certain sort of "real" spinor calculus (i.e. without distinc- 
tion between the dotted and undotted spinor indices) as long as £, and &, are real. 
(In a forthcoming paper we intend to generalize these considerations to complex oper- 
ators). 

Solving the above problem, we get among others a number of simple formulae 
containing special unitary operators and certain new operator indentities. These 
formulae are of some use in certain computations performed on operators in 
quantum mechanics, since by making use of them one can in a simple way make 
use of the linear unitary transformations. 

The problem considered in this paper seems to us interesting not only from 
a purely mathematical point of view. We think that its solution opens certain possibil- 
ities for physical applications. Indeed this work arose from a physical problem put 
forward by one of us (J.P.) concerning the construction and discussion of generalized 
Kennard packets for the harmonic oscillator and the quantized free field. The detailed 
investigation of their classical properties (Plebanski 1954) may be of importance 
for the interpretation of quantum theory. 

We expect that after generalizing the formalism of this paper to complex opera- 
tors further applications of the mathematical methods to quantum electrodynamics 
and other domains in which spinor operators appear may be found. 

In this paper we shall use Dirac’s (1947) notation 


$1. Notations. Index quantities and their properties 


Let p and x be momentum and coordinate operators of a system of one degree 
of freedom? satisfying 


[р, x] = —ih 
If а has the dimension of length and b of momentum and if a:b =h, then 


61 = xja, &, = р[Ь are dimensionless real operators satisfying the commutation 
df df : 
relations 


GUAPA a) 
df Р 


Асов: него 
1 We consider for simplicity the one-dimensional case. Most of the considerations of this paper 
may in a natural way be generalized to the case of more degrees of freedom, 
* In the sequel free Greek indices run from 1 to 2, 


Certain Class of Unitary Transformations 


> 
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The present work is devoted to the study of linear unitary transformations which 
may be applied to the operators £, and the study of the application of the correspond- 
ing unitary operators to state vectors. 

We shall prove that operators £, behave with respect to linear unitary transfor- 
mations like the two components of a covariant spinor with respect to unimodular 
transformations. Therefore, it will be convenient to define ”contravariant’’ operators 
&% as wellas a number of other. quantities connected with the operators 6, asthey 
are in spinor calculus (Infeld and van der Waerden 1933, cf. also Bade and 
Jehle 1953). Sinee in this paper we consider only real operators, the "spinor" 
formalism is somewhat simplified. In particular it is unnecessary to distinguish 
between dotted and undotted indices. 

'The introduction of spinor-like quantities enables us to write down most of the 
formulae in a concise and symmetric form, and to use a convenient approach 
to certain group- theoretical facts connected with linear unitary transformations. 

We define contravariant operators é^ as follows: 


1 = £,, 2 = —£,, (1j 
df df 
If we define the matrices Yag and y? by 
0, A p . 
Е Ls (1.3) 
а" (to df 


and apply the Einstein convention concerning summation over dummy indices 
we easily see that 


y yg, = OF (1.3a) 
and that, due to (1.2), 


Eta уш, Ё, = ё yg (1.4) 


Constructing further quantities with Greek indices (С — numbers or operators) 
we assume that consistently with (1.4), one may rise or lower the indices, according 


to the following rules 
Vim — & ae vct AD 7 LIE UN _= A= a’ C Уо (1 .5) 


—.— hs 


It can easily be verified, that in accordance with (1.2) and (1.3) the commutation 
relations (1.1) may now be written in the form 


Lene questa E = GARE ик (1.6) 
or 
ады cii (1.63) 


Operators &, therefore fona a spinor for which &, £*24 0. Obviously for 
numbers y, connected with y^ by (1.3) one always has : 


f. Yay = 0 =Y Ya 
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Let us now construct the symmetric quantity 


ly Suv 221, % (6155 + 8) (1.7) 
*aB — 9 ($ S38 S8 Sa и (EE de 2), 2,2 
£2 E E t 
ха8 = 1. (ёа 28-1 EB fo) = $2 ГА (5152 z 2) (1.7а) 
2 — % $ 5551) © 1 


and the ”mixed’ matrices 


в Го Ион, = 55 
ee EUN: (1.8) 
p age <2 LG) (m CN] " 


For (1.8) to be consistent with rules (1.5) one has to assume that the Latin 
indices running from 0 to 2 may be raised or lowered by means of the symmetric mat- 
rices 


gN oe 0, = 1, 0 = 8“ (1.9) 


Опе can easily verify that the matrices с defined by (1.8) and symmetric in the 
Greek indices have the following properties 


1 1 Е р 
TED 9 (1.103) быв" = (s 82 + дв x) (1.10b) 


4 
gka g^, + січе gh; = E gt! др (1.10c) 
E 1 
oa aL er g 5e (1.104) 


where £p is equal to +1 for an even permutation of 0, 1, 2 corresponding to Ё, l, п, 
to —1 in the case of an odd permutation, and to 0 in all other cases. 
By means of the o-matrices let us now construct from хр the "Latin" vector 


Et ал 


X* — oF xy = ры (824-8), EE yet) (lie) 


In virtue of (1.10b) one easily sees that by means of X, one gets from a given с 
the х.з again, namely 


у, x96 = or ль, % ав == Okap yE (1.115) 


Obviously the three components of the "vector" X,are real operators. 
In virtue of definition (1.11) and using (1.10) and (1.6) one can easily show 
that the operators X, have the following properties: 


| ХХК = X4 — X1 — Xi yp rtm | (1.12) 
[Х.Х ме Oe (1.124) B ТЕХ, (1.12Ъ) 
[X 8) 24 Mr, (1.120) [Ж Ed = 2i0 & (1.124) 
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It can easily be verified that with £, c-numbers and with »,, and X, still defined 
by (1.11) one gets X, X* = 0. We can also note that one of the relations (1.13) follow- 
ing from (1.1) may be considered as a definition of the matrix o if one assumes 


"a priori" the form of X, as expressed by £, and &, 


$2. The Group of Linear Unitary Transformations 


The definitions and relations (1.12) and (1.13) used so far are of a purely alge 
braic character. The purpose of these definitions and of the use of the "spinor" no- 
tation should become clear as soon as we turn to our main problem, which is 


a study of linear unitary transformations in a space of non-commuting operators £,. 


For a linear transformation of the operators £, given by 


f = thE + te (2.1) 


to be a unitary transformation, the numbers £, and t, must be (i) real (а unitary 
transformation transforms real operators into real operators) and (ii) the numbers ЕР 
must satisfy the relation 


| f | л и (2.1a) 


This is due to the fact that according to the definition of unitary transformations 
the commutation relations і.е. [£,, £,] = i must be invariant with respect to these 
transformations. 

Transformations (2.1) with the condition (2.1a) form a 5-parameter group of 
linear unitary transformations denoted in the sequel by v; ( composition of two trans- 
formations gives an element of this group again, for each element of the group there 
exists an inverse on account of (2.la), there exists a unit element ie. the identical 
transformation t,? = 6%, t, = 0). 

Evidently the group of linear homogeneous unitary transformations 


E = tes (2.2) | г? | = 1 (2.2а) 


forms а subgroup of т. We shall denote this subgroup by ту. 

The transformations from the group t may easily be given a geometrical inter- 
pretation. Forgetting for a moment that the £, are operators, let us treat (£,, 2) as real 
numbers representing the coordinates of a point in a plane. In this plane consider 
the area of the parallelogram defined by two vectors. If the plane is transformed 
by means. of (9.1) then, on account of (2.1а), the area is an invariant. Let us consider, 
therefore, such linear transformations in the space (81, E), physically identical with 
the phase space, which conserve the "volume." 

Thus if £, were c-numbers, we would have to do with a special case of canonical 


transformations. 
Let us also write the transformation law for £^ induced by (2.1) in the form 


p art +r l (2.3) 
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where 
7 = — = ya Yat = ym ty (2.3a) 
On account of the invariance of & 2“ = i with respect to a linear unitary trans- 
formation, we can easily see that by aplying (2.1) and (2.3) jointly, the matrix tj 
is ап inverse with respect to tg Le. 


161,9 = др (2.3Ъ) 


We shall add an obvious remark that if one adopts the transformation law 
for Greek indices consistently with (2.1) and (2.3), the scalar products of the type 
y®r, will turn out to be invariant with respect to the transformations considered 
independently of whether y^ and g, are c-numbers or operators. 


$3. Representation of unitary operators leading to linear unitary transformations 

We now turn to the question of representing unitary operators which lead to 
linear homogeneous unitary transformations. Let a unitary operator U have the 
property l 

UE, U* = £ it E : (3.1) 
where the numbers £ satisfy the condition | é4, | = 1. 

Relation (3.1) determines the representation of U and, therefore, U itself but 
for a phase factor. Indeed let | £> be the eigenkets (£,| &,> = &,| £,») of the opera- 
tors £, normalized to ó-functions 

5158» = ê (fı — 5). 150) = IH) «| E» = её /2л). _ 

On account of [£,, £,] =i, we have therefore £j = —i 9/9€,, so that if (3.1) 


is rewritten as UE, = tf £, pec Bolo bn dp eT TW 
Sy. &) = <E, |U] 4D, we obtain the result that 5 satisfies T: 


As — me 
SEY (ws. &- 1 ta) © E 
rhs و‎ Б » s ge^ idm а т Fe ә \ y 


fa 


wer en nt} 
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IRA pr 


performing some appropriate Fourier integrations ((E, | £,» = ), compute 


all expressions of the form Cé, | U | &,>. As a result of this computation, we get finally 
Е егч і P , б. 
1 1 
ТТ ейи ME is ы 
al EO = ҮЗ | ° exp 2:2 (25, E - t £7 Е 28 t? £7) (3.4b] 
2 
iA f 
" "m CM =, „ , Ре 
<2.10|;> = V3ajrii і REE (— 24, $1 + tiéz —1, 87) (3.46) 
1 
А 7 ей» 1 Be oe ; 2 
т Ро (ени МЕЗ} а 
2 


where the phases Аш аге related to each other in the following way 
P 2512, 7t ; 1 
À12 3 Ai siti sgn bo ty px , Agı = А11 xr Т. sgn tit 


Aaa = 211 — (sgn 2,2112 + sgn tt) T (3.4е) 

Relations (3.4) are valid, of course, on the assumption that all tf are different 
from zero. If any of the t? vanishes (since |t| = 1 at most two of the numbers 
1? may vanish simultaneously), then from at least two completely determined repre- 
sentations out of (3.4a—3.4d) one should go over to the remaining by means of a Fou- 
rier transformation. In consequence one then arrives at а ö-function with a linear 
combination of £, and £, as argument multiplied by a certain factor depending on E. 
Expressions of this type will be discussed in some detail in $9 where some special 
unitary operators will be considered. 

As far as the representation of the unitary operators connected with general 
non-homogeneous linear unitary transformations are considered, we can arrive at 
these representations by applying a similar method to that used already in the case 
of homogeneous transformations (they will have a form analogous to the represen- 
tations given by (3.4) with terms linear in Е, and £j also appearing in the exponent). 
Knowing the representations of a unitary operator connected with the translation 


E, = &, + t, and the relations (3.4) (being already computed), we can get the above 
mentioned representations by computing some simple Fourier integrals. 


84. Unitary operators connected with linear unitary transformations 


In this section we shall give unitary operators, constructed out of £,, leading 
to linear unitary transformations. It is well-known that every unitary operator may 
be written in the form еТ, where T is a certain real operator. We are interested 
in unitary operators U = eT connected with linear unitary transformations and, 
therefore, such that eiTg e iT can be expressed linearly in terms of £5. 
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On account of the well-known formula 


ehe = » i"/n! [a, [а, [2.. [a, 5]... Ш (4.1) 

valid for arbitrary operators a and b, it is clear that the above operators eiT where T 

is a quadratic form in the operators &, with real coefficients, will possess the 

required property. Indeed, the commutator of T and £, may then be expressed line- 

arly in £5 so that the sum of multiple commutations of T with &, can also be expressed 
linearly in £5. 

We are therefore going to consider unitary operators of the form U = e'f, 


where 
1 1 1 А А 
Us Бу ат 5 а 5 al? (18. + Eat) + 818, + $78, (4.2a) 


all, a22, а12, £1, £? being real numbers? 
It can easily be seen that assuming a?! = a!?, i.e. postulating the symmetry 
of a, T may be expressed, according to (1.7), by 


i j 
Tu * a? хав +. ET Eu. (4.2b) 


According to (1.11b), we have х, = 2055 X; Therefore, if we introduce the pa- 
rameters | 


| aho Abos (4.2c) 
df 
by means of which а% may be expressed by 
а = 20% a, (4.2d) 
then our quadratic form in the &, can be written as 
Т аб X, + EE. (4.2е) 


Obviously it is immaterial whether a ог a% are considered as originally given para- 
meters of the form Т; depending on which is more convenient we shall use, therefore, 
a* or a® respectively. 

Let us now calculate e'7£,e-*7, where Т is given by (4.2b) or (4.2e). The calcu- 
lation may be performed in two equivalent ways: 

1) Putting £,(4) df e?T£ae-^T with T written as (4.2a), we get by differentiating 
and computing [T, £,] a system of differential equations for &(4). By formal inte- 
gration of these equations with the initial condition &,(0) = &,, and by subsequently 
putting A = l in the computed integrals, we get the quantities required. 

2) With T given by (4.2e), making use of (4.1) and computing the multiple 
commutators by means of (1.11b) and (1.10), we get the Eis quantities by sum- 
ming up the series thus obtained. 

————— CES a ERRANT E e uud. IST. 

Б produce 216, and ££, in T should be understood as && + &£, because of the reality of T. 
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Аз a result of the simpler method (1) we get formulae which may by written 
either by means of the a“ or the a^ parameters, as follows 


ge exe = or put Ув) st Va = thee zz Lo (4.3a) 
where 

tê = cos wôf — sin w/w a;? = cos 08 — 2 sin w/w at Qu (4.3b) 

df Я 

1122 1 1 ^ 
w A [022422 — (а1?)?]% = (+ ага) = (a, а^)*. (4.3c) 
e 1/02 - dapë? = — 2wo ag а в (4.3d) 
1 (эт w/2\ © sino о sino/2\” 9, sin © 
= B В — — Е EB 8 

to Be ( "p, ) Gap ё? + © УавёВ = ( "mp, Gaga" £P +- = Уавё? (4.3e) 


The quantities у, defined by (4.34) appeared in (4.3a) as result of the reduction 
of constant terms in the linear differential equations which are integrated by me- 
thod (1). 

Obviously they have only a well defined meaning if о 0 (cf. eq. (4.3e)). Quan- 
tities ¢, defined by (4.3e) also have a meaning when œ = 0. (perform in (4.3e) the 
transition œ — 0). If о is imaginary, (4.3b) may also be expressed in terms of hy- 
perbolic functions. а Э 

Equations (4.3) are valid for arbitrary values of the parameters а, “(ог a*, £*). 
In our considerations these parameters are real numbers. As a consequence of the 
procedure (particularly clearly іп’ case of method (2) based on (4.1)) they are also 
valid for complex а%, ёо, In this case the operator е Г 
We see, therefore, that the unitary operator eT, where T is a quadratic form 


is not unitary. 


in £, containing 5 real parameters, leads to a particular linear unitary transformation 
iow m + t,, where tf and t, are defined by (4.3b) and (4.3e) (it can easily be seen 
that || = 1). The class of linear unitary transformations which may be produced 
by such operators e'f is subject, however, to a certain limitation. Taking namely 
the trace of t, from (4.3b), we find that 


1“ = 20890. (4.4) 


Since a” are real the w from (4.3e) may either be real or purely imaginary, so that 
according to (4.4) we have > — 2. By means of the operators eT only linear 
unitary transformations of trace >> —2 are possible. — . 

Let us clarify the réle of y, appearing in (4.3a). In virtue of (4.2e), (4.3a) and 
(4.3b), (4.3e) with arbitraty у, we have 


oa p, et te = Ey + y, (4.5a) 
Let now w? = аћа, # 0; taking 


Ai hs eye SAT SER ee (4.5b) 
i df E ; 
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o 


we can easily show that according to (4.5a), (4.5Ъ), (1.10b) with arbitrary a LIS 


we have 
o 


е^ йа [ak X, + ЕЕ] et a = a*X, Т, (4.5c) 


o 
where the number T is given by 
o o 9 zB 
T == Іо? ал X; = C рав ETE (4.5d) 
df df 
Formulae (4.5) give a simple prescription for reducing, by means of a unitary 
transformation, the linear terms appearing in an arbitrary quadratic form in the 
operators &,. It is clear that such reduction is only possible if y, are well defined, 
that is when а^ а, 75 0, which means that the quadratic form is not "parabolic." 
With the help of (4.5) it may easily be seen that 


Qo Xr PED а ля (4.6) 


o 
provided a* a, Æ 0. Here y, and Tare taken from (4.5b) and (4.54). The transformation 
o 
corresponding to the operator exp i(a* X, + £* &,) is, therefore, a superposition of 
three unitary transformations: translations by the amount of y,, homogeneous trans- 


formations Е, = t? E, and finally a translation by the amount of —y,. This explains 
the way in which у, appears in (4.33). 


85. The explicit form of a unitary operator giving rise to a given linear unitary 
transformation 


Consider the following problem: given the numbers £P, t, with |22 | =1 
Defining Е, = £g + tq, we wish to construct out of £, such a unitary operator U 
that UE, U* Е, 

The last condition leaves a constant phase factor of U undetermined. This ambi- 
guity is the only one left in U. If Ut and Ug) have the property 


Day & Са = Ea = Ша, E, Un 


then Од) and Ug, can differ only by a constant phase factor. 


Indeed if f is an arbitrary function of &,, f = f (£1, £5), then according to this 
assumption 


Uf (5, £5) Uo = (Е, ©) = Uy f (&1, £9) U a» 


From the last equality it follows, however, that Uc Ua; commutes with an arbi- 
trary function of &,, so that it must be a number. Obviously this number which re- 
sults from the composition of two unitary operators must be of the form ей, Im(A) = 0. 

Formulae (4.3) allow us to solve the above problem of an explicit construction of 
a unitary operator corresponding to a given linear unitary transformation. 
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Schliesslich stimmen die berechneten Werte mit den experimentei- 
len gut, aber nicht vollständig überein, weil die so weit gehende Verein- 
fachung der Zustandsgleichung im allgemeinen eine merkliche Verände- 
rung der Differentialkoeffizienten verursacht. 


. Herrn Professor Dr. Witold Jacyna sprechen wir für seine 
wertvolle Anregung und Hilfe bei der Ausführung dieser Arbeit unseren 
herzlichsten Dank aus. 

Leningrad, November 1934. 
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The above statement yields a simple prescription for constructing a unitary 
operator that leads to a given linear unitary transformation satisfying the condition 
> —2. The question of ambiguity in choosing a particular w from (5.4e) will be 
отав in more detail іп $ 8 after we introduce convenient means for discussing it; 
then we shall also consider the case £j < —2. 

The form T in (5.6) is in a certain sense connected with the transformation 
(5.4a). In particular it possesses the following property: if we substitute therein 
£, t, ВЕ Fare it goes over into itself. 

Sich a property cannot be used, however, as a starting point for finding T for 
the transformation since it is satisfied by all quadratic forms in the operators £, pro- 
portional to T. 


6 &. The transformation laws of the quantities X, 
We shall now investigate what kind of transformations apply to the quantities 
X, defined by (1.11) with respect to linear homogeneous unitary operators $,, or 


which group of transformations of X, is induced by т, defined in $2. 
If U is a unitary operator such that 


f= Ug, Ui == tf Ep, | te | =l (6.1) 
then, in consequence of CDi (1.11) and (1.11b), it is easily seen that 
= UX, U* IX (6.2а) 
7 
where 
ee One (6.2b) 
We remark that, on account of (1.10a), the transformation £f = ôf as 
well as 12 = — 02 (6.2b) corresponds to the identity transformation of the X's, 
n = д” 
Е ke 


The group of the linear homogeneous unitary transformations of the operators 
X,, ie. the group of the linear homogeneous transformations preserving (1.12) аз 
well as the commutation relations of the X's (1.12a) will be denoted by £}. 

It is easily seen that the transformation matrices from the group £3, L7 are 
identical with the matrices of the Lorentz transformation with determinant +1 pre- 
serving the direction of time and operating in the 3-dimensional Minkowski space. 

Indeed, introducing (6.2a) into XX —3/4 taking account of (1.12a), we may 
easily find (in consequence of X,X*= mcn that 


g^ LE Dy = в" RR (6.32) 


Furthermore, introducing (6.2a) into (the transformed) relation (1.12a), and 


using the fact that L = g"" gL, is the reciprocal of L7 (in agreement with (6.3a)), 
we get pally 


topo Ге 1; 1 = ёсь» | (6.3b) 
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The meaning of relation (6.3b) is given by the fact that 
I| Ze | = 1. 
Moreover, from (6.2b) and directly from (1.8) it is easily seen that 


1 
Ly = 5 [e + (47)? + (2)? + (2)*] > 0 (6.3c) 


It may be added that 
ІХ = (t £)? —1 (6.3d) 


Let V be a unimodular spinor transformation group (i.e. linear complex transfor- 
mations in a 2-dimensional complex space with determinant + 1), while dk „ is a group 
of Lorentz transformations preserving the direction of time (with „С determinant + 1) 
defined in a 4-dimensional Minkowski space. 

It is easily seen that t, is related to Lg similarly as V to SER 

Indeed, it is well known that V is homomorphic with £,. The matrices дв and 
— 08 are the ”Normalteiler” of V; the corresponding factor group is isomorphic 
with .£,. Similarly here v, is homomorphic with L whereby the homomorphism 
is realized by means of (6.2b). The matrices ôg and — ôg yield the "Normalteiler" 
of t, while the corresponding factor group is isomorphic with „з. (To any transfor- 
mation of £, there exist exactly two independent transformations of т, differing 
only in sign.) 

The numbers t, subject to the condition [t| = 1 play (with respect to £,) in 
a certain sense the röle of the Cayley-Klein coefficients. 

As can be seen from $4, the homogeneous linear unitary transformation of the 
operators Ё, is determined by the unitary operator eia*X, whereby the transformation 
is restricted by t^ > —2. 

For our purposes it will be convenient to find the explicit expressions for Ly 
corresponding to the transformation connected with ei«"X and expressed by means 
of the quantities ар. = 

This may be attained іп two ways: (1) by introducing the expressions (4.3b) 
for t? into (6.2b) and computing the corresponding products of the contracted mat- 
rices 6 by means of (1.10). i 
2) By putting X,(A) = eña Xs X peia Xs 
we get df 

0 — — анна Х"(А) 
by differentiating and using (1.12a). Integrating the last integral equations with the 
initial condition X,(0) = X, and putting 4 = lin the intregrals we find the expression 
in question. i 
In consequence of either (1) or (2), we gẹt 


/ Xp = ее%Х ече = LX, (6.4a) 
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= (6% — e"e,) cos 20 + Epse e sin 20 + e"e, (6.4b) 
df 
where 
= (aya ^ (6.4) ер = ao. (6.44) 
df df 
The above formulae are also valid in the case о = 0 (of course, after a suitable limiting 
process), which yields then Гу in the form 


L; = дь + 2ара" + 2 „а (6.4е) 


From (6.4b) it can easily be found by computation that the matrices OL? related 
to the particular unitary operators R, = eia? X« (the bracket denotes that the sum- 


df 
mation convention should be abandoned) are of the form 
Ik: 0, 0 ch 20,0, — sh Za” 
WL? = | 0, cos 2a?, + sin 2a? | (6.5a) SIR. T 0 | (6.5b) 
0, — sin 2a, cos 2a? sh 2a 0, ch 2a. 
ch 2а?, + sh 2a?, 0 
Оу =| + sh 202, ch 2а?, 0 (6.5c) 
0, 0, il 


The above special unitary operators R, are connected respectively: Rg with a real 
rotation in the "plane" Ху, Xa; R,, В. with imaginary rotations in the "planes" 
Ху, X4 and Ху, X, 

Formulae (6.2a) and(6.2b) make possible a discussion of the transformation prop- 
erties of the homogenous quadratic forms of the operators £, (i.e. without the linear 
terms) with respect to the transformations belonging to t,. Namely, an arbitrary 
form may be written as b*X,. If U is a unitary operator from (6.1) then, according 
to (6.2), we have 


Ub*X,U = bX, (6.62), where 5% = Ш, (6.6b) 
df 
where Гр is one of the transformation matrices of £, given by (6. 23). 
a: to Snelaln = Em d it is к seen that 
The above relation means that the type of definitness of the homogeneous quad- 
ratic form of the operators &, cannot be changed by means of a linear unitary trans- 
formation. In this connection we may introduce the following terminology: 
We shall call the form b*X, elliptic, parabolic or hyperbolic according as b,b*> 0, 
b,b* = 0, b,b* < 0 respectively. 


We add the following remark: if we take U in (6.6a) in the form U = eia*X, then, 
iking a account of (6.4a, b) we get 


eia* X, b X, e-iakX, = bX,, (6.7a) 
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where 
b, = (b, —& (ej) cos 2w — £p eb sin 2w + e, (ble), (6.7b) 


where c and ер are given by (6.4c), (6.4d). Of course b, (from (6.7b)) satisfies (6.6c) 
which may be verified by a direct computation. 


$7. Reduction of quadratic forms of the operators &, to the canonical form 


In this section we shall briefly discuss the problem of reduction (by means of 
linear unitary transformations) of the quadratic forms in the operators &, to three 
possible canonical forms according to the type of definitness of the form. 


o 
Let us consider the form T= a*X, + £^£, and investigate first the case afa, 72 0. 
As is already known from $4 (4.6c), we may eliminate from T the linear terms by 
means of the unitary transformation 


eit, [ак X, + £ Е] era a* X, — T, 


where у“ is given by (4.5b), while T is given by (4.5d). Thus, it sufficés to discuss the 
question of reduction of a homogeneous form. 

Let us consider first the elliptic case a? a, > 0. Lt А = ға? X,, where e = +1, 
a? > 0, аа, > 0. According to (6.5a) it may easily be seen that by a suitable rotation 
in the "plane" X, and X, the term with X, can be eliminated from A, so that A 
will be a linear combination of X, and X,. With the aid of another imaginary rota- 
tion in the plane X, and X, according to (6.5c) we can eliminate also Xi. 

Or, by means of a simple computation, it can be shown that by assuming 


eB es e? Xi. ei? Xo: (7.1a) 


df 
where 


b? = — 13 arc tg (а?/ал), b? = + ih arth {(а2 + a2) Ja?) (7.1b) 
4 2 q. 2 
we get = = 
elB.e al X,e% = (а a*)* X, 

From (7.1b) it is obvious why such a reduction is impossible (7:1c) for a, a* < 0. 
In this case b? is imaginary so that e is not a unitary operator. 

In the hyperbolic case we have a similar situation as in the elliptic case. By means 
of suitable unitary transformations, 4 can be reduced to two alternative canonical 
forms: either to X, or to X, (with suitable constant factors). 

In more detail, the corresponding unitary transformations may be constructed 


as follows: if а? = 0, then taking b? = —1/2 arth a*/a! we have 
gi Xak Хе", = sgn a'(—a,a*)*X, (7.2a) 
If а? 5 0 and |a! | >|a®| then taking eB = «Хо. е"Х+, where b? = — 1/5 arc 


cos { | a? |(—a, а®)-% 5,503 = °! arth аах, we have 
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еВайХ e? = sgn а?(—а,а\)*Х, (7.25) 
Similarly, if a2 40 and | a | < |a? | then, taking e = е? Xi. gî X: where b1 = 
df 
0 
= к= ia arcth (a9? — a1?) 4/a?}, b? = — 1/2 areth a/a? 
а 
we have 
eiBa* X eB = sgn a*(—a,a")* Xo. (7.2c) 
Since from (6.52), we have 
e 4X DP Ao lig = Х;, et 9X, Doe = X, (7.3c) 


hence, combining exp (+i2/4X,) with a suitable e from formulae (7.2) it is always 
possible to give explicitly the unitary operators transforming the hyperbolic quadratic 
form into any of their two canonical alternatives. 

We could also perform the reduction of elliptic and hyperbolic forms by using 
the general formulae (6.7) instead of performing succesively the simple particular 
unitary transformations. However, this yields more complicated formulae. 

Let us consider now the parabolic case, i.e. the form 


T = ea*X, + ££, e=+1, а > 0, аа, — 0. 


Taking advantage of the definitions of X, and assuming 


y = (ag + ал) 5, y? = — sgn а, (а — ay)” (7.4a) 
df df 

then, as is easily seen, the parabolic form T may be written as 

T =, e[y^£,]? + £^£,. (7.4Ъ) 
Let us investigate first the case ФЕ 55 0. If 
id = (tt) GE, — Тр) (7.5a) 
then * 

| лр. Wie = xS Pts =. (7.5b) 


Thus, if the numbers 4^, 1° satisfy 


Gait em Uo raria Г) 
then #8 (from (7.5a)) may be used as coefficients in the homogeneous linear trans- 


formation é, = U£,U* = t PEs. Consequently our parabolic form T (in agreement 
with (7.5c)) will become 


T = UTU* =), ely? EJ + vet, (7.7) 
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If, in particular, we assume 


X = —€ sgn (р) ° | yet, % 8, n = в | v d % а (7.8а) 
so that (7.6) is assumed EUR then, according to (7.7) we get 
T=UTU*=e| са 6 E RN (7.8b) 
Similarly, assuming 
X EDN (y* £) | v* & |? 02, g^ = «| ye 22 ks of (7.9a) 


which also automatically satisfy (7.6) according to (7.7), we get 


T = UTU* = c|y* Ê, |* Of, é2 + 6) (7.9b) 


Thus, we see that the parabolic quadratic form in the operators £,, (7.4b) can be 

reduced to the two alternative canonical forms (7.8b), (7.8c) by unitary transfor- 
mations provided y* £,7 0. 

If on the other hand y^£, — 0 so that £^— Ay^ then T from (7.4b) may be written 

as | 

T = у, e [y* E, + e А (7.10) 

Let. e.g., y! 40; T from (7. 10а) will go over into 1/5 e [y^£,]? by the displace- 


ment transformation ras T F = &, — 4/41. Obviously the homogeneous 
linear unitary transformations may be chosen so that the last expression will go over 
either into 1], e & or 1/2 22, 

Obviously all the unitary operators (which come into account when reducing 
the parabolic form) are fully determined! by the numerical values of the coefficients 
of the linear transformation. E.g., we can build up the representations of those opera- 
tors explicitly according to the considerations in § 3. 


8. § The ambiguity in chosing w from the equation созо =}; tẹ 


According to formulae (5.4), in order to be able to give for a given 
transformation 


fate pact = 1° 2 (8.1a) 
an operator T with the property | 
вен, (8.1b) 
and assuming for T: 
T=-—5=— (1 X, ha (tj ۴ 4-05] `(8.1е) 
sin 0 sin w 


^ Up to phase factors. Í 
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we have to substitute for w in (8.1с) a solution of equation (8.14): 
cos = lt? (8.1d) 


which is not unique. Now, we possess convenient means for discussing this question 
in more detail This discussion will also enable us to give the eigenvalues 
of the operator as an arbitrary elliptic quadratic form in the operators É,. 

Let us investigate first the case | 1/, £^ | <1, where all roots of (8.1d) are real num- 
bers different from zero. If c, is a positive root of (8.14) lying in (0, л) i.e. œ m 


+ 


= Arc cos Ig tq then the remaining roots may be given by «c, = оо + 27, 4> 


= = — 0, +2rın_, where „„,n_ are integers. Thus, taking the roots from the branch 


On 
w, аз well аз from w_, we always find 
2 
wjsin w Sio с snos ОЕ. (8.2) 
sin 00 
Eu оч (8.3a) 
df sin Wo 


where n, is any fixed integer, while S is given by (8.1c). According to (8.1c) and (8.2) 
it is obvious that every T such that е7 possesses the property (8.1b) may be represent- 
ed as 

2am 


Im = Im + Wy + 2ang 


DE OT (8.3b) 


On the other hand, according to the considerations of 85 we know that two unitary 
operators leading to the same linear unitary transformation may differ from each 
other only by a phase factor. 
It follows, therefore, that 
. 2nm 2] 

е? as али, "o == ет, т = 0, +1, +2,..., (8.4) 
where A, is a number; of course А, = mA (where A is a number) for reasons of con- 
sistency. 

From (8.4) we get as a byproduct the eigenvalues of the operators which is an 
arbitrary elliptic quadratic form of the operators. 
Indeed, denoting 
k _ Og t 27п ka. в оа о + 2an, 
a کد‎ 01, (8.5а = né ke @ я 
PETTAN ta, (8.5а) g PDA б (tg 28.1%), (8.5b) 
we have obvoiusly 


aka, = (wy + 2лто)?. (8.5c) 


Regarding a^ and ga as primary, according to (8.5a, b, c) and (8.3a), we may 
write (8.4) as 


eiaa) “fakx, +848.) _ ей, (8.6) 
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where A is a number. (8.6) holds true if a,a*> 0, (af, £e are treated as primary given 
parameters). 

A is їо be determined now from (8.6). According to (4.5c), multiplying (8.6) 
by e-9" ёа from the left and by е?“ ёа from the right (where y" is given by (4.5b)), we 
find 

gira, aS) Вакх _ „А+ 2л(а;а5) лако ьо) (8.7а) 


Now, operating оп both sides of (8.7а) with a unitary transformation leading to а са- 
nonical form of a*X, (according to (7.1c)), we get 
م‎ Xo -— еї + 22(aga7) 73], aFay p86) (8.7b) 
Thus we see that exp i 2л X, is a number. This number may be determined as 
follows: let | > be an eigenket of the operator £, to the eigenvalue £j while | 0> 
is a ket vector with the property &, | 0» = ле", Since Xy =1, (£2 + £2) = 
= 1/3 (£i — i$o) (£1 + 1&5) + zand" (5, 4-1, 0>=0, we get 


CT so. v3 Ori)” 
«600240 = е V £0 


|2 (8—1) (& + isa 0» =e" |07 


from which it follows that the number exp i 27 X, is equal to exp i x. 
Thus, in consequence of (8.7b) we have 


E [г] —2n(a,a5)—*/ sako apê ê | i (8.7с) 


Now, if we are interested in solving the eigenvalue problem for the elliptic form 


x è d 3 1 —YoT : 
T = aX, Héé a*a, > 0, we notice that (since from (8.6) еї2л(а;0")— 12" — ей) 
we have 
gitana ВТ — ой, (8.8) 


which will be obtained when operating with both sides of the above equalities upon 
a non-trivial eigenket | 7’> of the operator T corresponding to the eigenvalue T”. 
It follows that the eigenvalues 7’ are to be found among the numbers T’ = 


= (a.a) 72 (m Sr =) ‚ where m are integers. 
л 


It is easily seen that for a? > 0 negative values of m cannot occur. Indeed, when 


acting upon T| T' > = (ау (m == ۹ | T2 with the unitary operators that 
È л 
reduce U to the canonical form, we get X,U |T > = tc + 1g) U | Т>. On the 
other hand, on account of the previously used decomposition Ху, we also have 
«T |U*XQU| T7 = «T | Uta (5 + i£) * (Ex +) U| T T >} 


from where it is immediately seen that m > 0 


Г deme иль. DNDERELSSCNSE E ыч 


5 As (21 + 9/9£)e I = О. 
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Finally, the quadratic elliptic form of the operators £,, Т = a*X, + £^£, has 
the following eigenvalues 
T',, = (asa) ial (т + 3/5) — (аза) -* a^a, p E EP. ПО 2 (8:9) 
0 
Let us consider now the question of ambiguity of the solution of (8.1d) in the 
case 1 <#ta%< oo. It may easily be shown that, taking for о any possible solutions 


of equation (8.14) as iw/sin a, we shall have 


о. E (8.108) 
shim) shiwy 
where 
iy = A,cht]s t^ = log (Hata + lat)’ — 1) (8.10b) 
af . 


Since the operator T (constructed according to (8.1d)) should be real, we must 
chose in (8.102) n = 0. It follows that in the case |<!/st,*<oo our formulae yield 
no ambiguity in the definition of the operator e'f for a given unitary transformation. 
Of course, the operator T (constructed for the transformation (8.la) according to 
(8.1c)) will in this case be a hyperbolic quadratic form of the operators &,. 

In the critical case 1/2 t = 1 equation (8.14) leaves no freedom at all for œw: in 
order to build according to (8.1с) а meaningfull T we have to assume w = 0, as only 
in this case is iw/sinw = i imaginary. The operator T constructed in this way (for 
a transformation with the property £^ = 2) is a parabolic quadratic form of &,. 

Summarizing, we see that for £ ^ —— 2 formulae (8.1) makes it possible to construct 
an operator e’ that leads to a given transformation. If t> 2 then the construction of T 
is unique. The ambiguity in T in the case |t,“| < 2 is connected with the discrete 
eigenvalues of the elliptic forms. This ambiguity however, influences only the phase 
factor in е as, according to (8.3) and (8.4), we have eT" = еї" . emâ, where A is 
given by (8.7c). In particular we may assume n, — 0. The quadratic form T E T iEn 
is distinguished from the remaining forms (leading to a given unitary transformation 
with the propery |ta,| < 2) by the fact that the vector a, (determining it) will possess 
а possibly smallest length, i.e. ара“ < л?. 

Formulae (8.1) enable us to construct, in a well defined way, unitary operators 
leading to given transformations with ¢,2> — 2. 

For reasons already discussed in $5 it is not possible to arrive at the transfor- 


mation é, = t P£ p t, if t^ <—2 with the aid of an operator of the form e'f, where Т 
is a quadratic form of the operators £,. However, formulae (8.1) make possible an 
explicit construction of such an operator U that 


E O (8.113) 
where ч 
jp du lies ы ие <= зо, (8.11Ь) 
Indeed, we notice that according to (4.3) 
ЖЕ eh — Е (8.12) 
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Denoting U = e" and t,8 = —t so that X = —t,? > — 2, we get, accord- 
ing to (8.11) and (8.12), 


UE, U* — tf, +t, (8.13) 
When 1% —2, we can solve іп ће aforementioned way the problem of finding 
the explicit form of Vue e'T, where T is a quadratic form of the operators &,. 
Thus, we are able to give the operator U from (8.11a) explicitly in the form 
U = eX. eT, where T is a definite quadratic form in the operators £a. 


Finally, in all cases we are able to construct explicitly the unitary operator U 
leading to a given linear unitary transformation. 


$ 9. Special linear unitary transformations 


In order to be able to take advantage in a computation-operative manner of the 
freedom that is left in several computations by linear unitary transformations it will 
be convenient to dispose of a few particularly simple transformations with character- 
istic properties, to know their representations, and the respective unitary operators, 
the mechanism of their operation upon the state vectors, and to compose the neces- 
sary, more general transformations as products of these few simple ones. 

To this end we shall discuss now in more detail a few special types of linear unitary 
transformations. 

(a) The transformation of rotation and the elliptic operator Ху. The operator 


R, = е“ will be called a rotation operator. Above all let us recall that according 


df 
to (8.b) and (8.7c) we have 
rk el) а у (9.1) 


As is easily seen by specializing (4.9), the operator R, acts upon &, according 
to the formulae 


By GR) ме (ren jn (9.23) 
so that acting with Rọ upon an arbitrary function of the £,, we have 
Ry f (Ex &) Ro = f (cost &, + sint £j, — sint & + cost é], (9.2Ь) 
which justifies the name “rotation operator". 
The operator Ry acts upon X, according to (6.52) as follows 


5 0, 0 
Ry pede SR = 0, cos 27, sin 27 Y (9.3) 
0, — sin 27, cos 2r 
In order to know the effect of operating with Ко upon the state vectors or their rep- 
resentatives wave functions, we have to know the elements Lél Rol 392 By special- 
izing formula (3.4a), on account of (9.2a) we get immediately in the case т = kn 
(k integer) ' № 
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` cs ei^u 
CIE me em pii. & È £j — cost (Er L4 ?y. (9.4) 
But in this way we are unable to find the phase factor in (£i | Ro | Ep: 

In order to give the representatives of Ry with definite phase factors we may 
use another method. Namely, X, = 1/,(2 + £2) (being the simplest elliptic form 
of the operators &,) is nothing S a Hamiltonian of a harmonic oscillator whose ei- 
genvalues are Ху = т + 1/2, т = 0, 1, 2,... (which is a special case of (8.8)). 
Let us denote ds corresponding normalized Ud by | m >. Then we have 


X,|m> = (m +!/,) |m> (9.58) «m [л> = баш. (9.55) 
У |n><m]|=1 (9.56) 
m=0 


Choosing the phase of | m> suitably, the representatives < £,| are given in terms 
of the eigenbras of the operators &,, as follows® i 

a | m5 = U, (5) em, (ê | т> = U, (ê) et (9.62) 
where 


U, (2) = a (2m) - e-"hH, (2). (9.60) 
df 


H,(z) being the m-th Hermite polynomial. 
Since in consequence of (9.5) 


б=ггх ох, У | т» <т | = > | m» eitm+ 0 <m | (9.7) 
т=0 m=0 
and denoting 
Kz, 2, w] = И, (2) U,, (w) ее + 9.8 
4 2 (9.8) 


then, taking account of (9.7) and (9.6a, b), we easily find 
pen | a» = К(т, &, AN . 1 eae £5» ES K Ема ED £j] e” 4 (9.9b) 
lee) = K [r, & £5] (9-90) (55 | e| &1> = K [r + m/s Es £1] -е-#|, (9.94) 


The expression for K may be obtained in the following way: substituting in formula 


(9.8) 


U, (2) = ж (2 т) ет" ie dpe Е 


— «о 


8 The phases in (9.6a) have been so chosen for symmetry; the consistency of (9.6a) with (9.6b) 
follows from the well-known formula 


+00 
(2л)-% f dbe-iba U, (5) = i™ U, (a). 
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we get, after reversing the order of summation and integration, 


+~ 


Rel zs zo] =з ad [ese Barats tela oh Su [V2(z — if) e-*']" U,(w)/Vm! 
«ns m=0 
On the other hand, from the property of the function generating the Hermite poly- 
nomials, it is readily seen that 


ii Е 
| +۷3 но] У ато, 0) 
т=0 


Making use of this result in the preceding equation, we find 


К [v, z, ш] = exp (— 1/2 — z?/, — wl — 22е-й + 2 zwe-*) x 
ite ; З 
aa f dfe _8%(1—e 7 27) + 2ißlze И — we 27) 
л — 
The evaluation of the remaining integrals in the two cases т = Кл, т 54 Ёл, 
where K is an integer, leads to the result 


К [kz, z, w] = 17 ô [z —(—1)* о], (9.10a 
ти” 


V2 z sin а 


Kika + а, z, и] = exp ija [( + w?) ctg x — 2 (— 1)*zw cos а], 


(9.10b) 
where > х 
m yu, k =0, £1, +2,... 

The relations (9.9) and (9.10) solve completely the question of the matrix ele- 
ments <| Юз >, whereby the cases of such values as v w Ro = еї for which 
these elements become infinite are automatically taken into account. 

The knowledge of < е-"*| £; > allows us to find the effect on the state vector 
of the operator е-", as we then know its effect on the representatives of this state 
vector, i.e. on the wave function. Denoting an arbitrary state by | v 2, we put 


Q(z) = «& | e^ | y >, P(r, ê) о <8 le | p>. 
a | 


| Knowing the representations (£j | V? and | £; | v» of О and P, we find 


E T +. `` ^A as 
Qerê) = Г |е DE E 1v» = SKE 1e E> d с», 
[> M is ^ 2, ы `` uu 
P(r ê) = [c&te aec = Је аа CEL yD. 
= is (9.115) 


Obviously the mechanism of the effect of её" on | y > is here equivalent to that 
of introducing the dependence of the initial state on time in the case of a harmonic 
E 


oscillator. _ 
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Comparing (9.11a) and (9.11Ъ) with (9.9) we see that 
Q(r, &) = P (v — a2, £3) - e* (9.12a) 
P(r, &) = Q (r + n[2, £) е (9.12b) 
These relations lead to an interesting remark concerning the notion of the standard 


ket introduced by Dirac in the third edition of his "Quantum Mechanics.” Dirac 
writes down the eigenkets of real operators é, in the form 


Lz lên ED [82 = 8 у> 
and similarly the bras s i 
<ë; | «6С: < 63| = (9 (Ea — Ea), 


where we have added the indices é} and £, to make it quite clear to which operators 
the respective kets and bras correspond. 
In this notation (9.12) becomes 


ro (& — 51) e^ |y» A. ух «шй ДОГ 
If, Е with the ЕЕ of symbolic calculus, we ponerse the law 
ET f (a) eT = f ] a e+] 
from analytic functions to the ó-function then, due to 
o eni 0.5): 
and the arbitrariness of |> in (9.13a), we get from (9.13a) 
<, 5) CA < ех, § (E, — E) (9.13Ъ) 


This relation is consistent with the whole of Dirac’s operational calculus in the 
notation presented by him in § 20 of his treatise, if we put 


<< me GOS < e ost (9.14a) < = gl < e-2 (9.14b) 
[et Ce ts '& 

> = е 12%, > e-t (9.14) >= ех >.“ (9.144) 
1 a [^ & 


((9.14c) is а consequence of (9.14а) as (>)* = <). 
The relations (9.14) yield quite definite forms of the unitary operators for passing 


from > to > and vice versa and may be useful in a symbolic calculus based оп a gene- 
[41 [e i 
ralized Dirac notation. 


In particular, if (9.14) is assumed a priori, then (9.6b) follows quite easily from 
(9.6a) as i 


<> Е. (£j — &)|m> = фе”?®, е0, 85 (£a — E) 2%, „—рузх, jns 
d Ča 4 


= ет) она, § (Е — Е) шу = م‎ € (£i — &) 
a 


m). 
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(b). The dilatation and the hyperbolic operator X,. 


We shall call the unitary operator К, = е the “dilatation operator" and the 
df 
corresponding unitary transformations “dilatations”. 


As aspecial case of equations (4.3) we find without trouble 
. О 
INR ЕЕ” б A) (9.158) 
af df ўе 
so that if f denotes an arbitrary function of the operators £,, we have 
Rif (з SN Rou f (E16, сло ЪЪ (9.15c) 
According to (6.5c) the effect of А, on X, is given by 


ch 2r, sh 2r, 0 
Eu no elu pe SR 27/ch2r,01. (9.16) 
af И uh 


Especially simple and interesting is the action of R, оп an eigenbra of the operators 


9 
E, As X, =o (8162 + 251) and [61, £j] = 1. We get X4 = — i ê, SH — i|, and 


ANE : д д А 
hence «сб | eet &, enr JE, =e" - expt Ede ФИ. == ty om 


—= et = (Өз =. els € +т) | = ет <6(&, — е1) |. 
5 & 
A similar reasoning applies to <,|. The results may be put in the self-expla- 


natory form 
«E le = ей" < Ee" |, eem = ect рет" (9.17а) 


In contradistinction to the case of the rotation operator, these results allow us to 
find the representatives of the operator R, = ež: without knowledge of the eigen- 
functions of the operator X,. Indeed, making use of (9.17a) and the relations 


hom 


LELED = da — £2), «165 = 605 — ê, <> те 


we find immediately 


CE RID = ей" (£e — ё), (9.182) 
AT ` эу т 

е8 = vez eoi (aac) (9.18b) 

RED = e 6 (Ee 8), (9.180) 


MT RS 
<fa| Rel > = E exp(— 1838 67°). (9.184) 
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Let us now take into consideration an arbitrary state represented by Q(£,) = 


= <E ly), P (£j) = <>. Thanks to (9.17), we get immediately the following 
df df i ime 
representatives of the |y> — state, where y> = ers p>, 
df 
QE) = «&|v» = О [£e] e (9.192) 
df 
P(E) = <& >» = P [Eze] e (9.19b) 
df 


It may be observed that for the validity of formula (9.15c) it is not necessary that v 
be a real number. If the operator е (т complex) could be considered as a proper 
operator, than formula (9.15c) would be a direct consequence of formula (4.1) in 
which the operators a and b may be real or complex. It seems, however, that the opera- 
tor е‘ (т complex) must be handled with great care. By formal generalization of the 
preceding conclusions we would arrive at a ö-function of a complex argument. It 
appears, however, that one may speak reasonably of the effect of the operator 
е"Хз (т complex) on states whose representatives are integral functions of the £j 
and £j variables. By considering equations (9.19) as defining the action of the &/*** 
-operator on |y>, we remain in harmony with the theory of this operator for real т. 
It is easily seen that with respect to such states the real operator ©?» would behave 
like multiplication by a real number. 

Here some remarks may be added concerning the simplest hyperbolic operator 
X, = 1, (£i + £55,) which is intimately connected with the dilatation operator 
R, = е, In the eigenproblem X, |X, > = X5|X5- there appears a continouos spec- 
trum running from — oo to + oo. This may be inferred by the following reasoning, 
giving at the same time expressions for the representatives ¢&,|X,>. As X, = — 


X3» 


д re mn с с 
= 11 gg, te we have (—i пок 18 EX = X44 


By integrating this equation we find ¢£,|X,> to within an unknown factor, and 
conclude that the integral of the square of the modulus of < £1|X55 diverges for all X}. 
Thus < &|Х.» has to be normalized to а ô — function, i. e. e X3» = à (X, — X3) 
and thus we get finally 


7 


qm e^ — GXi— И) log |а 
calx» = гүл € , (9.20 a) 


А 


where e’* is a constant phase factor. 


Quite similarly we can deduce the formula 
ей 


СХ — үл 


ei Yo log El , (9.20b) 


where e™ is a phase factor which is already determined by ей. We get the relation be- 
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со 
ee 
tween е and e”, namely е“ = 2 ys je log * cos x? by noting that 
+оо i 
AU | ar 
(a X»- x | e dE, CE, X55. 
— oo 


The easiest way to fulfil this relation is to put 


A = — u = 1,1052 Vs se COS %2. (9.200) 
л 
0 


(с). The imaginary rotation transformation and the operator Ху. 
The hyperbolic operator x, = !/4 (£2 — £1) and the connected transformation 
of imaginary rotation induced by the unitary operator К» = е"Х! are not so interesting 


df 
as the operators and transformations dealt with under (a) and (b). However, for the 
sake of completeness, we shall discuss briefly some of their fundamental properties. 
According to (4.3), it is easily seen that 


hr, sht 
ee cum 9.21 
Man is df es p 0.212) 
We also have, due to (6.5b), 
ch2r, 0, —sh 21 
RXR, = LUX їй = 13 0 (9.21b) 
d i— sh2v, 0, ch2x 


Concerning the action of R, on the state vectors, it may be observed that, 


because of (7.3) 


gi „© ¢ Xal4 : gi е лХ./4 (9.21c) 


. this action is a superposition of the already discussed actions of rotation and dila- 
tation operators. 

Operator X, has a continuous spectrum of eig 
оо; it is seen without difficulty that if хх, > = bebes then, according to 
(7.3): X, PIX > = X, АХ >. Hence if |Xx;» = |20 — Xj) >x, is an 
eigenstate of the operator with representatives given by formulae (9.20), phen |Ху;> 
is connected with it by the relation |х; ste PUN OSX ae e where 

e is a phase factor’. i 

Knowing <&|X,> and the representatives of the rotation operator, we can find 


the representatives of |X,> by simple integrations. 


envalues extending from — oo to 


? Therefore the following rule for the standard kets connected with x, and x, may be deduced: 


р 6 AD 


аа ааа: 
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(d). Parabolic transformations and operators 


The general parabolic operator T with e = + 1 and p*considered as primitive 


can be written in the following forms 


о iL е о 
T = 1$ e[y*6,]5. == 62 oa = E a% хов + EE, = a*X, + E*E,, (9.22) 
df 2 
where 
а = eyy, ak cok wp! (9.22 b) 
df 


Obviously aa, = 0. Taking а special case of T from (9.22а) and making use of 
(4.3), we find 
aTe e РР ДЕТ. (9.23a) 


where 
8 1 А. 
taf E д — EPY, ig = (S елар? E х) ёв (9.23Ъ) 


Ifa “parabolic” transformation is given beforehand for which &, = ЕВ Eg tty te =2 
then, according to (5.5), the unitary operator eT leads to the above transformation 
when i 
T = Ve [Vo (6, + 6468) — (tft? + 1) £s]. (9.24) 
Аз we already know, the unitary operator e'f with T given by (9.222) transforms 
X, thanks to (6.4e), as follows 
Е, (9.25а) 
Lt оа E ча, (9.25Ъ) 
4 | 
. where a, is given by (9.22c) if e and v, are considered as primitive. Equation (9.25 с) 
can easily be verified by introducing t? from (9.23b) into (6.2b) and making use of 
(1.10Ъ). It is obvious that, due to a*a, = 0, L% from (9.25 b) satisfies for arbitrary a, 
the relation ГЕ = 3 and this is consistent with (6.34) as t, = 2. 
Concerning the eigenfunction and eigenvalues of the parabolic form (9.22а), 
it may be observed that according to (7.8b) when ре, yé 0 а unitary operator U may 


be explicitly constructed (as well as its representatives by means of formulae given 


in $ 3) for which UTU* = ере | {1/„ 42 + £j). It suffices therefore to consider 
the eigenvalue problem for the canonical form of the parabolic form: P = 10,22 + £s. 


Passing to the representation of the well-known equation P/P'» E PP’) and 
remembering that & = — 19/9 &, we find (1,2 — i 9/9 E< £P» = PX EP >. 
This is a differential equation with separated variables; by solving it, we find without 
trouble that P possesses a continuous spectrum extending from — oo to + oo and that 
its representatives (£1/P'» normalized to а ô function read 


TER саа 
«4| Р » = Vor E et (Ps un ), | (9.262) 
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А 


where e^ is an arbitrary phase factor. Hence we get 


T 
EM 1 ae : PT 
CGP» = VR DS GEE -^Vi Фү? (=. —Р)] (9.26Ъ) 
290 л 
where 


~ 


1 
Ф (X) zy; sme 3 +x X) 
0 


is Airy’s function expressible in terms of Bessel functions® 


(e). The displacement transformation 


Several times we have availed ourselves of the unitary displacement transfor- 
mation, in this section we shall briefly sum up its properties and indicate some of its 
hitherto unmentioned characteristic features. 

According to (4.5), we have 


oie رغ‎ eta = E+ EP, (9.272) 
where 3 are arbitrary numbers, so that for an arbitrary function of the é, -operators 
es (Е, Ee) ече = f( + Ër ES + E). ` (9.27)b 


Taking (9.27b) into account, we shall call the operator Т = е-%%а the “displace- 
df 
ment operator". 


We shall consider now the following problem: the composition of two displace- 
ment transformations leads —independently of their order of suecession — to the displa- 
cement transformation which is the algebraic sum of the initial displacements. Accord- 
ing to the considerations concerning uniqueness of the definition of a unitary operator 
belonging to a given transformation, the product of two displacement operators in 
a given order of successsion can differ from the displacement operator belonging to 
the algebraic sum of the two initial displacements only by a definite phase factor. 
We shall now determine this factor. 

In accordance with the above facts, the following relations prevail 

(1) (2) (1) (2) 
e а. ea є e? - gi MIS (9.282) 


жой en OE ae ote О чыз Oe. re odis 
8 P(x) == Vas ИЙ (% ach) p 1% (% x) for X» 0 
апа 


P(x) = Vee {J «04 |519 + Jy G5 b») lor x (0 
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(2) (1) (1) (2) 
ойе Pe о CE , (9.28 b) 


aq) Q е5 | 
where &°, £, are arbitrary numbers, and A and u are the quantities to be found. 


Thanks to (9.27b), we have 
(1) (2) (1) (2) (1) (1) (2) а) (1) 


mo cC еа 
(2)(1) (2) (1) 
M e tea : e Ia Р ета 
hence, comparing (9.28а) with (9.281), we see that 
(1)(2) 
мес (9.28 c) 


inXo 


By taking the complex conjugate of (9.28a) and multiplying the result by e'"^* from 


the left and by е from the right, we get (noting also that under this transforma- 
tion £,— — £j 


(2) (1) (1) (2) 
mcm а ى‎ о ЕК) (9.28 а) 


Оп comparing (9.284) with (9.28Ъ), we find 


ей = еті (9.28е) 
and from (9.28с) апа (9. 28d) we get | 
(1)(2) (2)(1) 
RENT eve сш (9.281) 


Thus, we can finally write the general formula for the composition of displacement 
operators, as follows?: 


(1) (2) (1)(2) (1) (2) 
Еа ‚ва, 4 а га а 
eB a. е-#%а — gila . e i6 46063 (9.29) 


Special cases of formula (9.29) are 


(0) (0) (0)(0) (0) 
еб. е-й ER, o-a (9.30 а) 
(0) (0) (0)(0) (0) 
ей: org ê1 ے‎ ей? . o 2m (9.30Ъ) 
Жуз... (1) (2) Ў 
фе Putting а = — Ёа £ and b = — Ёа Ё, formula (9.4) сап be brought, on account of [a, 5] = 
2) Р 


— 144. into the form э 
cia eib = e-Yladl . ейа+ D), 


It is obvious that this last equation holds for arbitrary pairs of real operators a, b whose commutator is 


a number. A direct proof based on series multiplication and commutation rules seems to be rather 
troublesome, 
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These formulae allow us to give at once the results of the action of the operator 


o 
adi З * 
л = еа on eigenbras of the £,-operators. Namely, on account of £j = — i 9/9 &, 
and (9.30a), we have for instance 


2 д 
awe M. oU ` ° D COS . 0170 оК Moo 03 
Pe E | еа) — ей? — E BT E. ое ag 
оо о о д Еб 3 А 
ES, TED . COR E (EÊ, FES ` M 
=== (2 1035215,6 1). е 04 < &| = gi ASE +E 1) Z & E 


In the last term of the above formula < £j + A | evidently means the eigenbra 
of the operator corresponding to the eigenvalue & + [5c Quite analogously to 
(9.30b), we can get a similar relation for — £j. 

These relations may be condensed into the formulae 


cnet t aed ue сы (9.314) 


Ое" A e (CASE RED) (8 Е 53 (9.31b) 


UN E 
` EI ` as ` К x А <“ ` a je» 
ASG E oO Cb. (< 0)5 45) <| > Vor e! , the 
above relations lead immediately to the following representatives of the operator 
T = йа 


< & |7| & »» = iE + ÊÊ) . (5, + A a &) , (9.323) 
оо с Ш А a 
À Nr сое 
cama» =е wA (9.32) 
оо o 1 E, o 
: ССО Mee ee SS SEG ene 
<®|П|ё >. = е et (9.32c) 
< 2,1 | 3 > = mU + Ê) (5, + &,— Э) (9.324) 


From (9.31) the mechanism of the action of the е-&а -operator on wave 
functions can also be found without difficulty. Let |v > represent an arbitrary state 
with Q(£) = «& | >, P(E) = < éz | v» as representatives. We build now the state 

df df 


| y> = е-%а | y >. In accordance with (9.6), its representatives will be given by the 
df 


formulae 


Q(&) = Elp y =e +4 OG + ÊD, (9.33) 
df 

P(E) =< &lv» — ehh +8 p (Е Ej). (9.335) 
df 


The IT = eta -operator displaces, so to say, the center of the wave function 


in phase space by E. and endows it with a definite phase factor linearly dependent on £j 
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$ 10. The composition of the operator of homogeneous transformation with the 
displacement operator | 


In the preceding section we dealt with some specially simple types of unitary linear 
transformations and their corresponding unitary operators. The last question which 
we intend to discuss now concerns the composition of unitary linear transforma- 
tions and of unitary operators connected with them. This question is of importance 
since the simple transformations of § 9 allow us to calculate any given sought for trans- 
formation by decomposing it into special transformations; we did it, e. g., in $7 
while reducing quadratic forms of the £,-operators to canonical form. It is then 
very convenient indeed to have explicitly at our disposal a unitary operator corres- 
ponding to the composition of two simple transformations. 

In general the work of §§ 5 and 6 allow us to solve the above problem 


to a certain degree. Transforming & 
ei 


a successively by a series of simple unitary 


operators known from § 9: ei^ , we finally get 


еп. ее ее Бе In — qus +t, 


where tf and t, are given numbers. Treating é, = t é, + taas an a priori given 
transformation according to §§ 5, and 8, we can explicitly construct a unitary opera- 
tor eT leading to this transformation; T is a quadratic form in &, (if tq > — 2; if 
t? <—2 then we must take е - eT instead of eT). It is clear that this unitary opera- 
tor can differ from the operator ef»... e+e! only by a definite phase factor. 

In the present investigation we shall confine ourselves to the determination of 
this phase factor only for the composition of the general homogeneous transfor- 
mation operator with the displacement operator ein 


According to (4.6), if œ? = a, a? 0, then 


TN C 23 ‘ak ; | 
PIC е EQ — eiT. oiya . сі "Хр. орча (10.1a) 


о о 
where y" and Т are expressed in terms of а, and &, (formulae (4.51) and (45а). 
We have also 
е^Хь ela = RX e-r $a ei^ X р. ei? X, = PEL : ci^ X 
9 
where t,? is to be considered as expressed in terms of a, by means d formula (4.3b) 
It follows that (10.12) may be written in the form 


T ei X44 а) = ета . е dts Д X (10. 1b) 


Applying now the law of а of the displacement operators given in 
(9.29) to the product е? #а. еа 8, we may put relation (10.1b) into the form 


tr. gi (atx, +8% a) — gif ye Вур. gie Xy (10.1‹) 


We now define 7° = y" — у? tg’, substitute herein for y? expression (4.5b) and 
df 


for t, expression (4.3b) and solve for &°, We then substitute for Ё in (10.1c) &° ax 
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pressed in terms of а^ and у" and finally formula (10.1с) becomes 


cinta . oiakxX, _ ый скай), (10.2a) 
where 

E, = o], ctg wla Na — она (10.2b) 

df 
А = p(w) opas a*n"? (10.2) 

df 

p(w) = (1 — sin w/w) (2 sin w/a) ? (10.24) 

df 
w = (a, а)? (10.2e) 


Formulae (10.2) solve the problem of finding the law of composition of displace- 
ment operators with unitary operators of homogeneous transformations. 

Formulae (10.2) have been deduced under the assumption that о 4 0. It is 
easily seen, however, that the limiting process w > 0 leads to a definite finite result 
(lim p(w) = tẹ lim 0/5 ctg w/a = 1), so that these formulae are also valid for о = 0. 


®«—>0 w—>0 A M 

In order to find analogous formulae for er. ga , one must simply take the 
conjugate complex expression of (10.2) and substitute afterwards a* — — af, 

5^ — — 7". This yields 
die X. pêa — uA UG x, + tto), - (10.3a) 

where 
aum wla ctg wle 1° + gaa a^ n^ (10.3 b) 
df 

A = ф(о) Rap 1" nê а^ (10.3 с) 

df 


and p(w) and w are again defined by (10.24) and (10.26) respectively. 

Obviously, formulae (10.3) remain valid in the limiting case о > 0; in the special 
case where all a*— 0, (10.2a) and (10.3a) reduce to trivial identities. 

Formulae (10.2) and (10.3) evidently become meaningless for o = nz; п = + А 
+ 2, ...; in particular they cannot be used in order to put together-into one unitary 
operator the operator of change of sign (g e") and the displacement operator. 

Formulae (10.2) and (10.3) solved for ne in terms of ¿° yield definite formulae for 
the decomposition of the general operators of a linear transformation into the opera- 
tor of the homogeneous transformation and the displacement operator. 


811. Concluding remarks 


It follows from the considerations of the preceding sections that unitary linear 
transformations are closely connected with quadratic forms of the momentum and 
position operators, i. e. of the real operators £,. These considerations form a rather 
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autonomic whole; it has been possible to solve by the method exposed in the present 
paper a great many problems concerning quadratic forms of &, -operators and our 
formulae give adequate means of solving all concrete problems in the theory of qua- 
dratic forms of the £, -operators and of unitary linear transformations. 

In particular it is obviously possible to find by means of these formulae not only 
the eigenvalues of any general quadratic form of the &, -operators but also its normal- 
ized eigenfunctions in explicit form. Namely, in § 9 normalized eigenfunctions were 
found for all canonical quadratic forms of the £, -operators; since also representa- 
tives of all simple unitary operators needed for reducing the general quadratic form 
of the &, -operators have been given there, the calculations of the eigenfunctions 
of this quadratic form reduce to the performance of some definite quadratures. 
(Moreover, all the integrals appearing here are explicitly calulable by means of ele- 
mentary functions — with the sole exception of parabolic forms where Bessel func- 
tions are also needed). 

Our formalism cannot be directly applied to the physically most interesting case 
of the quantum theory of fields, dealing chiefly with non-real operators (besides, the 
quadratic forms appearing in that theory depend as a rule on non-commuting opera- 
tors). 

As already mentioned, a great part of our formulae also remain valid for com- 
plex parameters, but the question arises then if the operators thus obtained are proper 
ones. Therefore a generalization of the formalism set forth in the present paper to 
cover the case of complex &, -operators seems highly needed. We intend to treat 
this problem in a subsequent paper based on Dirac’s (1937) theory of complex re- 
presentations (see also Iwata 1951). 


КРАТКОЕ СОДЕРЖАНИЕ 


Инфельд и Плебанский, О некотором классе унитарных преобра- 
зований. 


Работа посвящена ислледованию линейных унитарных преобразований, ко- 
торым могут подвергаться HeKOMMyTATMBHBIe операторы местоположения и VM- 
пульса. Ввиду того, что эти операторы поводятся при линейных унитарных преоб- 
разованиях аналогично двум компонентам ковариантного спинора при унимодуляр- 
ных преобразованиях, в $ 1 введено индексную запись и развинуто формализм ана- 
логичный спинорному исчислению. В § 2 определено группу линейных унитарных 
преобразований т и группу однородных унитарных преобразований Tn, a B $ 3 mo- 
дано представления унитарных операторов проводящих к заданным преобразова- 
ниям ИЗ tn. В $ 4 проведено дискуссию действия общих унитарных операторов вида 
eiT (T — квадратичная форма операторов местоположения и импульса), которые 
приводят к преобразованию из т, ав § 5 решено вопрос о нахождении эффек- 
тивно оператора eiT, который должен приводить к заданному преобразованию из т. 
В дальнзйшем $ 6 проведено дискуссию группы преобразований. которую инду- 
цирует tn для некоторых трех действительных операторов, являющихся квадра- 
тичными формами операторов местоположения и импульса (физически иден- 
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тичные с TaMMJIBTOHMaHOM, лагранжианом и действием гармонического осцилля- 
тора). Оказывается, эта группа идентична C Ls, т.е. c ,resticted Lorentz group" 
действующей в трехмерном пространстве Минковского. В § 7 проведено дискуссию 
редукции посредством линейных унитарных преобразований квадратичных форм 
операторов местоположения и импульса к трем возможным каноническим видам, 
в $ 8, в связи с вопросом о нахождении еТ приводящего к заданному преобразо- 
ванию из Т, подано простой способ нахождения собственных значений оператора 
являющегося общей эллиптичной квадратичной формой операторов местополо- 
жения и импульса. В 8 9, специализируя общую теорию с предшествующих глав, 
проведено детальную дискуссию ряда специальных унитарных преобразований 
и связанных с ними унитарных операторов, и найдено механизм действия этих 
операторов на векторы состояния. Формулы этого параграфа могут быть, как по- 
лагается полезны при вычислениях выполняемых непосредственно на операторах 
в разных проблемах квантовой механики. В последней главе 10 проведено дис- 
куссию вопроса о сложении линейных унитарных преобразований и связанных 
с ними унитарных операторов. 
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A GENERALIZATION OF THE YOUNG-RUBINOWICZ 
PRINCIPLE IN THE THEORY OF DIFFRACTION 


Ву К. S. INGARDEN, 
Institute of Mathematics of the Polish Academy of Sciences, Section of Geometrical Optics, Wroclaw 
( Received October 13, 1954) 


The paper contains a generalization of Rubinowicz method of decomposition of 
the wave behind a Kirchhoff screen into the "incident" wave иу and the "'diffracted" 
wave Up for the case of an arbitrary "geometrical" wave falling on the screen. The calcu- 
lations are carried out in approximation of Kirchhoff's scalar theory for an arbitrary 
shape of the screen. Fock's approximate solution of the wave equation (1950) is used 
for the incident wave. The result for wp is a line integral containing a series expansion 
in powers of 1/Ё (k — the wave number) as well as of 1/R (R — the distance from the 
integration point to the observation point) and gives Rubinowicz’ formula as’ a special 
case. In the ’’parageometric approximation” a specially simple extension of Rubinowicz' 
result is obtained, being a series whose coefficients define the aberrations of the incident 
wave along the diffracting edge. 


1. Introduction 


In his well-known paper „Die Beugungswelle in der Kirchhoffschen Theorie der 
Beugungserscheinungen* (1917) Rubinowicz performed a separation of the (scalar) 
light wave behind a Kirchhoff screen of arbitrary shape into two components: an 
“incident wave“ иг and a “diffracted wave“ up. The first component can be obtained 
from the wave falling on the screen by subtracting from it those of Из parts which 
lie in the geometrical shadow of the screen. The second component is that part of the 
whole wave behind the screen which is left after subtracting uz. Rubinowicz gave 
a formula for up for the case of the spherical incident wave valid in Kirchhoff’s approx- 
imation. The result has the form of a line integral along the edge of the screen and 
can be interpreted in terms of the principle formulated for the first tiine by Thomas 
Young (1802). Young tried to explain the phenomenon of diffraction as interference 
of the directly incident wave (primary wave) and the spherical “elementary“ waves 
(secondary waves, “wavelets“) originating at the edge points of the screen. Fresnel, 
who first succesfully unified Huygens’ idea of elementary waves with Young’s idea 
of interference (in his work which was awarded a prize by the Paris Academy in 1818, 
see Fresnel 1826, 1866), showed the insufficiency of Young’s formulation. At the same 


(17) 
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time, however, Fresnel showed a way for a better formulation both of Huygens' as 
well as of Young's principle. This way consisted in the introduction of a ,,direction 
factor“ into the expression for an elementary wave by means of which this wave ceases 
to be exactly spherical and is converted into a „directed wave“. Fresnel introduced 
such a factor into the formulation of Huygens’ principle (although not in an explicit 
form) and obtained considerable success in confrontation with experiment. The 
analogous consideration of Young’s principle was performed not by Fresnel himself 
but, almost a century later, by Rubinowicz (1917) who carried out Fresnel’s program 
for a special but very important case. In principle, the correctness of Young’s idea 
for a still more special case (a plane wave falling on a diffracting half-plane) could 
be inferred already from the first exact solution of the diffraction problem by Som- 
merfeld (1896, 1901). In spite of the great importance of Sommerfeld's and Rubinowicz’ 
papers, the problem of the general validity or rather of the general form of Young’s 
principle remained open. The significance of this problem grew recently considerably 
in connection with the development of the s. c. diffraction theory of aberrations of 
optical systems (see the last report on this subject, Wolf 1951). The understanding of 
the réle of diffraction in formation of optical images, growing continuously from the 
times of Abbe, led eventually to the fusion of the geometrical theory of aberrations 
with the theory of diffraction. As clearly formulated by Born about twenty yers ago 
(Born 1932), it is unreasonable and unpractical to develop further these two disciplines 
independently, since they are both theories of idealized limiting cases of only academic 
interest (if not considered as rough approximations). For practical applications it is 
necessary to have a theory of the „mixed“ cases corresponding to real situations. It is 
obvious that for such a „combined“ theory Rubinowicz’ decomposition u = Ugeom + Чаш 
would be of great importance in spite of the singularities and discontinuities of both 
components inherent in this decomposition. 

From the experimental work done till now, many examples are known of consider- 
able discrepancies between the present, too narrow, theory and the results of obser- 
vation (see e. g. Conrady 1926, Nienhuis 1948, 1949). Even qualitative statements of 
the theory are in many cases insufficient or doubtful. The latter concerns also the 
Young-Rubinowicz principle. We may quote from Wolf’s report (1951, р. 116): „Many 
assertions in Nienhuis’ thesis and in other literature give the impression that the 
transformation of the diffraction integral into an integral expressing the edge effect 
and a term expressing the geometrical effect has been carried out in the general case, 
but an examination of the relevant papers reveals that the published proofs relate to 
spherical and plane waves only“. Doubts about the general validity of the Young- 


! For historical exactness it must be mentioned that the transformation of the diffraction integral 
in о a line integral for the case of a spherical incident wave was given first by Maggi (1888). His paper, 
however, published in Italian n an Italian mathematical journal was for many years almost completely 
unknown to physicists and was discovered by Kottler (1923) already after Rubinowicz’ work. The latter 
was not only wholly independent of Maggi’s paper but was also done by a different method and pre- 
sents a much more complete mathematical and physical discussion of the problem. 
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-Rubinowicz principle were expressed by van Kampen (1948, p. 580): ,,Writing u = 
Ugeom + Чан» Maggi and Rubinowicz showed that in the case of the spherical wave- 
-front из can be written as a line-integral along the edge. This of course is possible 
for any special shape of wave-front, provided that it is a given function-of the whole 
aperture. For a justification, however, of the expression ,,diffraction a the edge“ it is 
necessary to show that из depends only on the incident wave in the neighbourhood 
of the edge. That is actually the case for the second kind [critical] points, but the 
higher terms of the first kind [critical] points give rise to another part of the diffrac- 
tion, which cannot be ascribed to the edge“. (l'he meaning of the last sentence of van 
Kampen will be explained in the last section of the present paper.) We see, therefore, 
that the problem is still open and that a priori it is difficult to say whether it may be 
answered positively. The aim of this paper is to clear up this problem. 

Similarly as Rubinowicz, we shall confine ourselves to the approximation of 
Kirchhoff's scalar theory and to thin screens (Kirchhoff's screens). 


2. The general geometrical wave 


By a wave we shall understand a (complex) solution и of the wave equation | 
Au+k*u = 0, (2.1) 


where Ё is a positive constant = 27/A (this means that the optical medium is 
considered as homogeneous and isotropic; / is the wave length), satisfying Sommer- 
feld's radiation condition at infinity 


ди 
1 — س‎ Ў = 0 A 
s Mj ( 5; D (2:2) 


(r= Va? + y? + 2%, where x, у, z are Cartesian orthogonal coordinates). According 
to this terminology we shall use rather the term „wavelet“ than „elementary wave“ 
if we are not certain that (2.1) is satisfied. Writing u, as usually, in the form 


и = Ach, (2.3) 


where the functions A (amplitude) and S (phase or eiconal) are real and independent 
of k, and assuming AZ 0 (what is equivalent to u7 0) we have instead of (2.1) 


12 А? [1 — (grad 5)?] + AAA + ik div (A? grad 5) = 0. (2.4) 


Separating (2.4) into real and imaginary parts, we get 


(a) (grad S)? = 1 + сс (b) div (4? grad 5) = 0. (2.5) 


is limited and Ё is sufficiently large, we have 


AA 
E = 


|AA| < k? |A]. (2.6) 
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Then we can consider approximately A and 5 as restricted by 
(grad о) =l (2.5a’) 
and (2.5b). We shall call the function (2.3) satisfying (2.2), (2.5a’) and (2.6) a geomet- 


rical wave. Not every wave is a geometrical one and, in general, geometrical waves 
are exact waves only in the limit for A > oo. For the sequel, therefore, we assume that 


k is very large: 


k > 1. (2.7) 
It must be stressed, however, that there exist also waves which are geometrical for 
every К. For instance, the plane wave 
еїах by +c2) 


(a, b, c — real constants satisfying the equation a? + b? + c? = 1) and the spherical 
wave . 
ей 
Q 
(о being the distance form the “source” or “focal“ point) are geometrical for every 
value of k in the whole space (in the second case the source point is excluded). 
A geometrical wave is defined. uniquely by its values on some surface & non-tan- 
gential to the rays (the latter given by vectors grad S). The solution of such a boundary 
problem was given recently by Fock in his paper on a generalization of Fresnel’s re- 
flection formulae (1950). This solution is very convenient for our purposes. We recapit- 
ulate, therefore, briefly those of Fock’s results (in slightly modified notation) which 
will be applied in the present paper. 
Let the surface 2 be given by means of the parametric representation 


x, = P, (р), (28) 


where (x,) = (x, y, 2), (и = 1, 2, 3), and p', (i = 1,2), are Gaussian parameters on 
the surface. The line element on 2 is determined by the “first. differential form“ 


99, 99, 


dA lc 
ат? = gj dp! dp, gij Opi Әрі 
и 


(2.9) 
(we apply the Einstein summation convention only to Roman indices). The unit 
vector normal to X at a point p! is given by 


RINI Эф, 39, 
ПМ, Зы ICE > 
Ny (р?) үг 2: E we Ip! Эр?’ (2.10) 
where g = Det (g;) and Eye denotes the completely antisymmetric tensor of Levi- 
Civita in x,— space. We need also the “second differential form“ of the surface Е: 


2 
т as Gi dp! арі, Gi = : ,ال ل‎ (2.11) 
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where R, is the radius of curvature of the curve obtained аз a section of the surface 

: д 

X by the plane through the vector n, and the displacement vector on 27: dx, mr dp’. 
Let the values of и on 27 be given by 


u (p) = a (p) ео. (2-12) 


The direction of the wave propagation through every point of & is given by the unit 


vector 
о, / . до до 
Su(p') = 8" Opi dpi + Mp laor Эр Әр? (2.13) 


(gg; — д; = „Kronecker’s delta^). We choose the positive sense on the coordinate 


lines p! — const. and p? — const. in such a way that for a given s, 


en, $, > 0. (2.14) 


Surfaces of constant phase S = const. („wave surfaces“) are determined by the equa- 
tions 

x, = Pu (p) + le — 0 (2) s, (р) = v, (ро), (2.15) 
where the parameter o takes arbitrary, but constant, values and the p? s vary over 
their intervals of variability. The parameters ( р! ,0) can serve as curvilinear coordi- 
nates in space (we call them wave coordinates). The connection of wave coordinates 
with Cartesian coordinates is determined by (2.15). For given (p',o) the x,’s are 
uniquely determined, but the reverse is not always true. Using wave coordinates, we 
have the following expression for the line element in space 


di? = вар’ dp! + до?, hg = y" Tim Т, (2.16) 
where 
> gio? до " 
й — pi звао 20, 211 
I nas "С pit um u 
T, — 285 — 0,0; + (0—0) (о; + б V1—g"6,0,), (2.18) 
920 1 do 1 1 li дв ыс 2.19 
00 = oropa СТА ДЕ). 8 дрі др! др" Y ( 19) 


Denoting by s = o —о(р') the distance along the ray from Z to the wave surface 
S — o, we obtain for Fock's »spreading factor" (which determines the spreading or 
shrinking of the wave surface as a function of s) 

Det (Ti () р (2.20) 
& ¥ 1 — воо; 
By the help of D Fock’s final formula for u can be written 


ТЕС (2.21) 


D ($) = 
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or more explicitly 


: D(pi; 0) ALIM 

(a) A (pi, о) = «(p') Nii =a (b) 5(рь о) = e. (2.22) 
The function (2.21) satisfies the boundary conditions (2.12) and (2.2) and equations 
(2.5a’) and (2.5b). It is, however, rather difficult to check whether condition (2.6) 
is satisfied. Fock assumes that the function а (р?) is a “slowly varying” function i. e. 
such *that the derivatives of this function in directions tangential to the surface divided 
by k are small in comparison with the values of the function itself” (Fock 1950, p. 974). 
By more exact inspection it is seene, however, that Fock's assumption is, in general, 
insufficient for (2.6) (some assumptions concerning the second derivatives are also 
necessary). Not going into a detailed study of the rather complicated question of 
sufficient conditions for the boundary values (2.12) to guarantee (2.6), we assume 
only that the functions & (р!) and о (p) satisfy for given g,(p') such regularity con- 
ditions that the inequality (2.6) is fulfilled by (2.22a) (provided Ё is sufficiently large) 
at every point of space with the possible exception of the points of the surface 


D(s) — 0, (2.23) 


i.e. of the locus of singular points of (2.21). According to (2.18), (2.19), and (2.20), 
equation (2.23) can be written 
80 | no 3-6 = 0, (2.24) 
where €, 7, © are some functions of p! which may be calculated by means of the for- 
mulae quoted if the functions (р), a (p°), с(р’) are given. From (2.24) we see that 
the singular points of (2.21) form, in general, a surface intersecting every optical ray 
in two points at most. This surface is called a focal or caustic surface of the wave and 
is a generalization of the concept of focal (source) points of spherical waves. 
Fock’s solution (2.21) is a generalization of two simplest solutions of the wave 
equation 
(a) a plane wave u = ae, 
RER: KU = 
(b) a spherical wave и = — ее = —— eike 
0 5-0 
(in both cases @„ = const.; for (а) D = const., for (b) D = const. o? = const, (s+0)?) 
and represents the most general geometrical wave. 


3. The diffracted wave ч 


We shall assume that the incident wave in the diffraction problem is а general 
geometrical wave given by Fock’s solution (2.21). Of course, it does not constitute 
the most general case possible, but the loss of generality is practically without impor- 
tance. The wave obtained by the process of diffraction from a geometrical wave is, 
in general, already о non-geometrical wave, but, as we shall see, its deviation from 
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a non-geometrical wave is mostly very small. After the secondary diffraction of such 
a wave, the *primary" deviations from geometricity will play a very unessential róle 
in the end result and can be neglected in comparison with the *secondary" diffractional 
term. Exceptions occur only when the aperture in the second diffracting screen is not 
wholly filled up by the “direct” light, i. e. when a part of the second aperture is placed 
in the geometrical shadow of the first screen. Then we may proceed inversely and 
neglect the second diffraction in comparison with the first one in regions cut out by the 
shadow boundary of the first screen, since there the secondary diffraction term is of | 
the second order of smallness in comparison with the term resulting from diffraction 
at the first screen. In any case, excluding only some exceptional situations of rather 
small interest in practical optics, we may consider the incident wave practically always 
as a purely geometrical wave. Errors resulting from this assumption will be, in general, 
small as compared with errors connected with Kirchhoff’s approximation. 

We assume that the diffracting edge is an analytical closed? curve without double 
(or multiple) points and that one may put a regular surface Х (2.8) through it. We 
choose the Gaussian parameters p! on this surface in such a way that опе of the para- 
meters, e.g. plz p, Ваза constant value (e.g. equal to 1) along the diffracting edge, 
inside of the aperture 0 <p <1, outside p >1, and that the second parameter, 
p? == Ф, varies along the edge from 0 to 2л. We assume further that on & the func- 
tions а (p, 9) and c (p, p) are given and satisfy all assumptions of Section 2. In this 
way the incident wave is exactly determined. 

After Rubinowicz (1917), we decompose Kirchhoff’s integral 


1 — @ Ге ей Ou 1 
ee кы | am | کے‎ => |Q 3.1 
“= Am D A T 2 df zi df с 
Е Е 


(r is the distance from the observation point to the integration point, df — the surface 


element, Se differentiation along the normal to the surface in the sense of the 
n 


ЫЕ ди Е : Я , 
incident wave, „ and РЕ а given functions on the surface of integration, F — 
n 


part of 27 inside of the aperture) in 


à 


1 1 
Жр + is | Odf.= : 22 
u= Qdf je if = ш + up (3.2) 
F+K K 


where K is the surface of the shadow boundary formed by the rays of the incident 
wave passing trough the edge points and determined by the vectors s, (1,9), 0 <Y «2m, 
see (2.13); ит and up denote the first and the second component respectively. The 


“Se уык на DIU ESI X ee 


2 The assumption of closeness is not necessary for the following considerations, but we choose 
it to fix the ideas and because it corresponds to the prevailing majority of practical cases. For the general 
case only some minor and obvious changes should be made in the following formulae. 
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normal vector to the К —surface points into the region with: 0 <р <1. Taking 
in (3.1) and (3.2) 


u = a(p, p) pa = = E grad (= Ve =) | оп F, (3.3) 
n 5=0 


Е Ра, g ;0) I a ee 34 
RE о Oe $4 


we have (cf. Rubinowicz 1917, p. 260) 
D(p, 9 39) 


a(p, E aT 
ur = (P. 9) D(p, ¢ ; o —o (p. 9)) 
0 for p 1. 


eike for 0 «C p « 1. (3.5) 


Our task consists in evaluation of the integral for up. 

Following Rubinowicz, we introduce orthogonal Gaussian parameters on the 
surface К, namely the edge parameter 9,0 Sp <2л, and the ray parameter (the 
phase) o, с (1,9) = > (ф) 52 <20. We have 


2л со 
1 23 
puces T1882 Ji do О Vw, (3.6) 
0 9,9) 
where 
А0 
Ет 0, (es hoy (9: 0) = ш(ф)о? + > (p)o + Ep); (3.7) 


see (2.16) — (2.19). Using (3.4), (3.7) and (2.24), we get 


2x оо 
1 Е 2 ik(o +r) 
Lose Lr [оон “кек т Cae ein 
0 c = 


From Fig. 1 we obtain, similarly as Rubinowicz (1917), 
r? = R? + (o — о)? + 2R (о — o) cos (В, s), (3.9) 


(mee cos (п, r) = t cos (n, R). (3.10) 


Hence 


up (R, cos (R, 5), cos (В, п)) = 


Fig. 1. Explanation of the notation used 


7 = f M (p, R, cos (Р, s), cos (R, п)) dp (3.11) 
0 
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where 
М (9. R, cos (R, s), cos (В, n)) 


== — i К cos (В, n) Ver + qo + C N (p, В, cos (R, s)) (3.12) 


О БОЕ conn) 1 
N (p, R, cos (В, s) are ik - do (3.13) 


(а, 0, u,v», 9, =,7,6 are given real functions of ф corresponding to the theory 
presented in Section 2 ; г depends on ф, К. and cos (R,s) according to (3.9). 
Formulae (3.11) to (3.13) express the general form of Young’s principle. A further 
discussion of it is possible assuming Laurent’s series expansion 


uo? + vo t 5 Be 3 
lE - xe. } Sg 


Since the right hand side of (3.17) can be interpreted for real о as Taylor's 


and 


; : 1 cd 
expansion of some real function of x = —, all coefficients a, must be real. 


The series (3.14) is uniformly convergent outside of a sufficiently large circle about 
the origin in the complex plane of о. The radius of this circle ọọ is, іп. general, 
a function of 9. We shall assume for the following that the valne o —o(y) lies for every 9 
(ф)>в.(Ф) 
Substituting (3.10) and (3.14) into (3.8) we have 


2n 55) со 
ji ү eik(o +r) d 1 
icm = dpaßR cos (n, R) Ў = ао ——— (2 cos (3.15) 
0 Е ^ 


m 
ais 

й = СЕА y^ Е. К (3.16) 

r = VR? (o — o) 4 2R (o — o) cos (Е, 8). (3.17) 


For curring out the integration of (3.15) we use a generalization of 
Rubinowicz’ method (1917) (see also Sommerfeld 1950, р. 323). We have from (3.9) 


© = p — о + R cos (R, s) 


and 


(i+ %) =r+g—o-+ R cos (R, s) (3.18) 
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as well as 


d 1 UE (3.19) 
do [г + о—с + R cos (R, s)] т г? 


Using succesively the formula 


a [E | ( 2) eike+r)  neikle+tr) 
== =tk{1 + — — 
do о" do о" ont 
eikle +r) t d eik(o TD n d eik(e +r) 
e eS Р meos 5 [`+ 
Um zx a) +- yt =) 


n (n +1) d (een 
m aye Co) + 
(ik)? (1 + 


(n-+-m— 1! 1 d leike+n 

SIR (n — 1)! Gk) (eme (3.20) 

- et са 
йо 


we get 


From (3.18), (3.19) and (3.20) we have 


; - [2 (a- 14 Me: Dus 
4 = Ол? r е : [r + o —с + R cos (R, S] г do o" 


eik(o +r) d 1 
mL 2 [r +e —a + R cos (R, s)] j^ 


Т (n +т—1)! гт—1 d [etke+n 
= (n =D J (7-0 —04- Roos (R, s) do \ em J 
BONITA жы дё п-т — 1)! 
[1 + cos (R, s)] В? gn I Sean (n — 1)! GA)” ET 1)! (ik)m I (3.21) 


For the evaluation of integrals I, we use the method of successive integrations by parts. 
Applying once more (3.20), we get 


n eik(o+R) rm—3 eike +r) d 
Tm ~~ [I + соз (R, s)|" +R? ont eg a M E 
о—о + R cos (R, 8) gm 


TE [r о —о + Е соз (R, 3) "+1 отт ~ [1 + cos (R, 3) "+18? от+т 


A Generalization of the Young-Rubinowicz Principle in the Theory of Diffraction 87 


T у (n +m 4- 1—1)! efko +R) 2m 
(n + m — 1)! (iE) *! [1 + cos (А, 5) |" t1 R3 gini aae 1- cos (R, 5) 


ON: J- m -- L—1)! (n +m -- L4-v —1)! eik(o+R) 
(n Ет 1)! (n +m +1—1)! (ik) *v +2 [1 + cos (R, s)]nttvtiRign m toro 


x [aneta en 


6 m (l +m) 9 o 
[1 + cos (R, | ae (5-22) 


Substituting (3.21) and (3.22) into (3.15), we obtain finally 


2л 


1 „ eR cos(n,R) Sa, 1 = (п 4- m — 1)! 
, т=з сш а iko Mao. БА. 
D zi инь (Rcs) itm п! о" = T. 


0 


со 


| ml У (n +m + L— 1)! 1 
* П cos (R, s)]"o" n + m —1)! (i)?! [1 + cos (R, s)! Ro! 


ro 
M 2m (n +m +1 + vy —1)! 1 
E T 1 F cos (R, s) -Mi (п + m = +1 [1 + cos (В, 5)]° Ro? 
ah ee БИ 
x (v 1) (m 4-1 2) ea ar 
6m (l + т) 
"TTF соз (m, в? +s) | es 


The rest term S, gives for up expressions with R-4and higher powers of R ^. It can be 
succesively computed by the same method as above with an arbitrary degree of accu- 
racy. Because of the assumption (2.7) the expansion (3. 23) is reasonable and in any 
case it is an asymptotic expansion of up. Not going into discussion of the convergence 
problem, we shall discuss only some particular cases. Neglecting terms with negative 
powers of k, we get a *parageometric approximation" (a name proposed by Toraldo 
di Francia, 1950) for up: 


2л 


1 eR cos (п, В) = et 
uera ioe aa SS en mgr * 
LOC Ал “л. К l4-cos(R,s) = п! 0" en 
0 


For large k and not too small distances from the shadow boundary K (where the 
cos (n, R) 

1 + cos (R, s) 

approximation for ир + This formula is a simple generalization of Rubinowicz’ result _ 


expression has its singularity) formula (3.24) should give a very good 
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(1917, р. 262, formulae (5) and (5а)). Since a, = 1, we have from the comparison of 
our and Rubinowicz’ results 


В = sin (s, t), (3.25) 


where # is a unit vector tangential to the diffracting edge (3.25) can be also deduced 
directly form the geometrical meaning of h,, іп (3.7)). Decomposing (3.25) into 
three components, similarly as Rubinowicz (1917), we get 


(a) iae sin (s, £) dy represents „this part [of the whole light intensity] 


which the light source transmits to the edge element for scattering“ (Rubinowicz 1917, 


p. 263). 
dur 3 ae 
(b) Б is a spherical ,,elementary wave“ in the sense of the original formula- 


tion of Young’s principle. The edge point is the source point of this wave. 
со 
(с) Шон n 2 is a „direction factor“. We see that this factor de- 
1 + cos (R, в) 4 n! о" 

pends оп the screen edge and on the incident wave and its aberrations. For а perfect 
unaberrational wave a, = 0, n >1. For a wave with geometrical aberrations а, are, 
in general, different from zero. The coefficients a, are, therefore, a measure of ge- 
ometrical aberrations occuring at the given edge (they are functions of the edge pa- 
rameter ф). Only these aberrations are of importance for diffraction. It would be possi- 
ble to develop also the general theory of geometrical aberrations on the same basis. 
At first it would be possible to define a,-coefficients not only for p = 1, but also for 
0 <p — 1 and in such a manner to construct a theory of aberrations of the wave. 
In order to develop a theory of aberrations not for waves but for optical systems, 
it is necessary to determine the-a,-coefficients for arbitrary waves emerging from the 
given optical system. We shall not ‘develop further this idea here, we remark only that 
such a method would be very convenient in the diffraction theory of aberrations, since 
it would allow to treat the geometrical and the diffractional effects on the same level. 

If R> 1, higher powers of !/R (beginning with А-2) can be neglected in (3.23). 
Preserving the expansion in powers of 1/k in the term of (3.23) which is proportional 
to В 1, we get а more accurate formula for up as (3.24), valid for large distances from 
the screen (and also for not too small distances from the shadow boundary): 


2л со 
: dr ee +R ( SURE (n+m—1l1)! a, Rm 
"ES R RT REDS n! (п — 1)! (ik)m от" (В Is: R)" 
(3.26) 


(we use here a vector notation as in: Kottler 1923, Baker and Copson 1939, 1950, 
Rubinowiez 1953). 


To get a still better approximation, we may use the full formula (3.23), but this is 
advisable only in exceptional cases. The general structure of this formula may be pre- 


т = 0 
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sented as follows 


2л 
1 eikR 
‘oak | dppact She (3.27) 
0 


q=1 


where f, is "the direction factor of the q-th order". 


4. Conclusions 


Our result contained in formulae (3.11)—(3.13)and in its special cases (3.23), (3.24) 
and (3.26) is a direct generalization of Rubinowicz’ formulae and can be interpreted in 
the same way, i. е. as a generalized Young's principle. In the derivation of our result 
the assumption that the incident wave is geometrical was essential. Indeed, only if 
the notion of the geometrical ray can be applied, the shadow boundary K has an exact 
meaning. If the incident wave would be non-geometrical, the notion of the ray could 
not be applied, and the shadow boundary would diffuse into some domain which 
cannot be exactly determined. It seems, therefore, that Rubinowicz’ decomposition 
can be carried out only if the incident wave is a geometrical wave. In other words, 
it seems that our generalization is the most general one from all possible generaliza- 
tions of the Young-Rubinowicz principle. 

Presenting our result (3.23) in the form (3.21), we see that all direction factors f, 
have, ехаЙу as Rubinowicz’ factor 


cos (n. R) 
1 + cos (R, s) ` 


a singular direction given by 
cos (В, s) = — 1 — or R=—Rs (4.1) 


in which they are infinite. The direction (4.1) is identic with the ray direction at the 
given edge point. This singularity is necessarily connected with the jump of the in- 
tegral ир on the shadow boundary compensating the jump of ит (of opposite sign). 

Finally we shall discuss briefly van Kampen’s criticism (cf. Section 1) of the 
Maggi-Rubinowicz decomposition. Van Kampen (1948) uses in his paper a method of 
stationary phase aplied earlier to diffraction problems by Kirchhoff and especially 
by Rubinowicz (1924). Van Kampen finds by this method a point of the aperture 
through which an incident ray must pass in order to reach the observation point. 
He calls this point a critical point of the first kind. Analogously he finds a point of the 
diffracting edge in which such an "edge reflection” (Rubinowicz 1924) occurs that 
the “reflected гау” goes also through the observation point. Van Kampen calls this 
point a critical point of the second kind. Expanding the functions occurring in Kirch- 
hoff's + .egral in series containing the coefficients of geometrical aberrations, he gets 
some series expansions for the contributions coming from the vicinities of the critica, 
points of the first and of the second kind respectively. He calls the latter expression 
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“diffraction at the edge” and believes that it is not identic with Rubinowicz’ function 
up. The reason is that, after van Kampen, up contains also а contribution from the 
vicinity of the critical point of the first kind and therefore does not represent the 
proper “edge effect”. Van Kampen's reasoning has, however, no sharp meaning and 
cannot neither be taken as correct nor rejected. It should be treated rather as a mis- 
understanding. Indeed, the method of stationary phase, as well as the more general 
saddle point method, is a typical approximation method and cannot give an exact 
decomposition of u into two components. This fact is particularly clearly seen in the 
case when the critical points of the first and of the second kind lie very close together. 
Then it is altogether impossible to disentangle the contributions of the vicinities of 
the critical points of both kinds and to separate by this method the *edge effect" 
from the “surface method". Rubinowicz’ formula (3.2) gives, on the contrary, a wholly 
exact and uniquely determined decomposition even in this special case. Of course, 
if the critical points of different kinds are sufficiently distant, van Kampen's expres- 
sions give approximately the values of the two  Rubinowicz-components. Our 
generalization of the Maggi-Rubinowicz decomposition confirms particularly strongly 
Rubinowicz point of view, since our result is also valid for an aberrational incident 
wave discussed by van Kampen. 

We see, therefore, that on the one hand van Kampen's criticism of the Young- 
Rubinowicz principle cannot be maintained, and that on the other hand Nienhuis' 
intuitive conviction of the general validity of this principle, a conviction which was 
intuitive although induced by observation, is quite right. 


The author expresses his sincere thanks to Professor A. Rubinowicz for his kind 
interest in the present work and for valuable discussions. 


КРАТКОЕ СОДЕРЖАНИЕ 


Ингарден, Обобщение принципа Юнга-Рубиновича в теории диффракции 


Работа обощает метод Рубиновича разложания волны за экраном Кирхгоффа 
на ‚падающую” волну ири „диффракционную” волну ир для случая произвольной 
„геометрической” волны падающей на экран. Расчет произведен в приближении 
скалярной теории Кирхгоффа для экранов произвольной формы. 

Для падающей волны употреблялось приближенное решение Фока волнового 


уравнения (1950). Результатом для ир есть криволинейный интеграл содержающий 


2 1 1 
разложение в ряд по степениям 3 (К — волновое число) и 35 (В — расстояние от 


точки интегрирования до точки наблюдения) и дающий формулу Рубиновича как 
специальный случай. В ‚„парагеометрическом приближении” получено особенно 
простое обощение результата Рубиновича, являющееся рядом с коэффициентами 
определяющими абберации падающей волны вдоль дифракционного края, 
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RADIATION THEORY IN CRYSTALS 
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This paper contains the radiation theory for the atom in a crystalline medium. 

In the first part we obtain equations for that class of problems using the formalism of 

quantum electrodynamics. Ап apparent mass of the photon appears in that formalism. 

In the second part we define (with certain simplifying assumptions) the formula for 

. the apparent mass of the photon and solve the equations of the radiation field in a crystal. 

The solution obtained is consistent with Ewald's classical result. In the third part we 
give an estimate of the emission probability in crystals. 


1. Introduction 


We shall consider an atom interacting with radiation in a crystalline medium, 
i. e. a system composed of a radiating atom, the field of radiation A, (x) and the field 
of electrons y (х) forming the medium. We assume that the electrons move in a given 
(non quantized) periodic field 4% (x). The electrons can be bound to nuclei or not. 
The equations of motion of the considered system have in the Heisenberg picture 
the following form 


ОО SAC 
H C H H 


9 E on i ie : 
b) E (2. An e) =. Ar (x) = Terr Au (x) vo). (1) 
с) the equations of motion of the atom perturbed by the fields Ay (x) and Ar 
| 9 Aula) 
ne ВН = 0, E Wy = 0 
d) the до (x) = 0, Ox, H , 


where j,(x) — iec : (x): and j4 (x) are respectively currents of electrons of the 
H HEHE H 


medium (we take the antisymmetrized current) and of the atom. Operators - ых) 


(95) 
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y(x) have here the common commutation or anticommutation properties. The sym- 


H 
bol:(...) : denotes an ordered product where all annihilation operators are on the 
== of the creation operators [Wick 1950]. We use a notation in which x = (x,) = 


(x, ict), (y) = (—iBa, В), va) = v*() В. 


By means of the unitary transformation: 


0 (x) SONOS Ol Гај Ol ee 
H 
where 
4. 9S[c] A =. 
the et = — (а) Au) Slo, 51—99] = 1, 
we get-a new picture, in which the equations of motion take the form 
I: | 
a) O A, (x) = Ju): 
ы | 4 (= وھ‎ = j Ane) + 2 
Ун Ox, ће (0 ) + x v(x) = К^ p(x) p(x), 
c) the eqations of motion 2 the atom perturbed only by the field A(x), 
КООШ [6б| 9A,(a) 
d) the LU Ee 2% (x) A,(x) [o], TTE V [c] (x on о), 


where j,(x) = tec: VY p(x): Of course this is not an interaction picture. 
. We shall solye equations (2) using a unitary transformation leading to 


the picture, for which the forms of the electron and electromagnetic operators are 
known. 


The interaction picture has such properties but it is adapted to the case of photons 
in vacum. We shall use here instead a new picture, better fitted for the description of 
photons in a material medium. It can be expected that in a material medium photons 
acquire (from a macroscopic point of view) an apparent mass depending on their 
direction, frequency, and on the structure of the medium. We shall determine this 
mass later. Now we introduce it formally, using the following unitary transformation. 


0°(х) = U[ ] O(x) И-Ч ], V*[] = О ] yfo), (3) 
where 
ite SU a SUE E (4) 
and 


Нд = — G Pu) Аа) + > ий Aula) 4,9) 
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Nn 


Jul) = iec: p°y,p°(x):. ш is here a c-number equal to the apparent mass multiplied 
by с|һ. The equations of motion in the new picture have the form 


a) (D — u?) 4°, (x) = 0, 


Pure | 
» |» (=. = о) -r “| y(x) = 0, 


9 
c) the equations of motion of the atom perturbed by the field Ах), S 


© 


d) ite SH) _ E О @ + @ Ale) +5 wt А) @) A | [о], 
с(х) 2 
The operator A, (x) satisfies here the following commutation rules: 

[А (xs (=) = сд, A (x — х), (6) 
where A (x —x) is an ordinary Jordan-Pauli function for mass u. Our new picture 
becomes the interaction picture for и = 0. Note that, independently of the value of u, 
the operator y^(x) is allways identical with the electron operator in the interaction 
picture. 

The relation between the electromagnetic tensor and vector does not change after 
the unitary transformations given by our operators S ([o] and U [o]. Therefore we have 
i CORN n — A. Thus we can gauge An 
Aj, = 0. We obtain it by replacing A, by Anz VEE on = А. We are enti- 
tled to do so because the first equation (5) does not change when (O — p?) x ^ 0 
(if the initial condition for x is appropriate). 

It follows from (3) and (4) [cf. Dyson 1949, Wick 1950] that 

+60 


06) = U [oe] V. a (- x) | йты TO О Но 
n=0 со 


Here the symbol T (...) is the chronological product as defined by Wick. Further 
calculations consist in splitting the chronological product T (...) into ordered products 
with suitable contractions which lead to the Feynman [1949] functions: 5% (x,x') and 
A (х — x’) [Wieck 1950]: 

po (ж) pola!) = SB а, x), Ana) Aria’) = heê, D (n — а) 
(the dot over operators repeated twice denotes contraction of both operators). These 
functions satisfy the following equations: 


д ) ех ех f j p 
(ео) ++] тел-н x), 


(DI — д5) Aq (v a) = 0 (x — х). 
Assuming the potential to be independent of time 4х) = A(x), we have: 
>. Vra (x) Vrs (х) @ 23 t^ 
es I: А —— у 1 
роя) = | „д ree) (< >), 
i r 


B 


° in different manners, especially so that 


and 


(8) 
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where y,(x) — и, (x) exp (— i E,t[h) are eigensolutions of Dirac equations with poten- 
tial A%*(x). The sums X+ extend over states of positive and negative energy respectively. 

Inserting 0°(х) = A, (x) in (7), we would obtain the perturbation solution for 
A(x) if we knew u. That would be true, e.g., if we used the interaction picture (и = 0). 
. When и is not known, we shall proceed in a way which will enable us to find it. 

Let us assume that и? is of the second order in the coupling constant. This asump- 
tion will be shown further to be consistent with the results. Inserting O^(x) — iQ 


in (7) and using Wick's method we have 
+оо 
іе? 


Jule) = U~*[>0] | с | dx’ [p° (x) VSE x’) y» v^ Qn): + 


— oo 


Be) pS", x) у ©: — tr Qr SE Qs) у, S (87, 2)] 4, @) = Ten 
Further terms in () brackets are of higher order in the coupling constant. 

Operator U[o] given by (4) describes processes in the medium (independent of 
rhe considered atom) resulting from interaction between electrons of the medium and 
the radiation field. Let us assume that in the medium processes of the first order do 
not occur. Then, for all states [0], we have 


Ujo] Чо] = Чо] 


(for terms up to the first order). After this assumption formula (9) takes the form 
+оо 


M iy 
jut) 50 == у: йх' [ p(x) Yu SE Qo) Yo PR): H: PUR) Ye SER) yu v^ G2 : 
+оо 
— tr (Yu S$ (% =) у» SE (о) A), (10) 
where we have taken into account terms up to the second order only. 

If we put (10) in the first equation (2), and insert afterwards the thus obtained 
A (x) in the fourth equation (2), the latter will describe processes following from inter- 
action between the atom and the radiation field in the medium. 

In order to find, e.g., the probability of one-quantum (spontaneous) emission we 
consider the transition from the initial state W; = V[—0o9] without photons in the 
field, to the state ¥, with one photon. Assuming an adiabatic switching on of interac- 
tion at the initial moment t = — oo and switching it off by the measurement at the 
final moment t, we obtain 


КЕ ACER : TEILS 
V. Rm sie p yr, Wi or T D. Vr, 
where W4, WR, wel are state vectors of the atom, radiation field, and electrons of the 


medium respectively. Then the probability amplitude of one-quantum emission inthe 
ime interval (—co, t) will be equal to: 


QUT = d da Qf jo) V» CP АА) YT >, (11) 
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where 
<A, (9? = QI AQ) FFD. (12) 
We assume also that the electrons of the medium are in equilibrium with the radiation 
field, i.e. that 
yr! C 7f. 

When the first and fourth equations (2) are solved the term DOS) in 
formula (10) gives the interaction of the electrons of the medium with the atom. 
In the first step of the method of successive approximation this term leads to diagram 
a. in Fig. 1. Using a suitable unitary transformation, we can eliminate this interaction 
from the equations of the radiation field, inserting it simultaneously in the Hamilto- 
nian of the atom. The eigenstates of the atom will then be only slightly modified. 
There appears namely (according to Bethe and others) a splitting of the energy levels 
depending on crystal structure and resulting from electrostatic interaction (a kind 
of Stark effect). References relating to that subject are given by Rubinowicz [1949]. 
We will not take into account the above modification of the states when comparing 
probabilities of emission in crystals and in vacuum. 


ra gu eia ® 
d el 
# 
A Р А А el A el A 
a. b. С. а. е. 
Fig. 1 


Three remaining terms in formula (10) give additional terms to the probabilities 
of transitions, e.g. to the probability of one-quantum transition. In the first step of the 
successive approximation method (formula(11)) they lead for that emission to diagrams 
c., d., e. of Fig. 1. Diagram e. describes the self-mass of the emitted photon in the 
medium. Besides, diagram b. will also appear. 

From the first equation (2) and formula (10) we get the following equation describ- 
ing in the considered approximation the radiation field in the given medium- 


оо 
С A, (x) E K yy (хх) А), | (13) 
where 2 
к) = E (PO va SE (o) ve) ET E v SF 60) Mo YA: 
We dropped here terms leading to diagrams a. and e. in Hig. (14) 


2. Ewald’s solution 


Let us assume that the electrons of the medium interacting with radiation are 
assembled in the vertices of a crystal lattice and that the latters can be considered as 
points compared to the radiation wave lengths. This assumption 1s used with good 
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—_ 


results in classical crystal optics (as well аз in X — ray optics — Ewald 1912, 1918). 
We shall solve equation (13) with that assumption and find и at the same time. 
The solution obtained will correspond with Ewald’s solution in clasical crystal optics. 

Inserting in the right hand of (13) a plane wave satisfying the first equation (5) 


A (x) = АЁ @) oF (+ + = -°) (15) 


and using formula (12), we obtain 
+00 
D AE (д> = f 4=< K, os =) AP eT, (16) 


where 
«Ки» )> c < VE KE GS x^ WP >. 
We will take into consideration the one-particle model for the electrons of the 
medium. Then the wave-function of electrons in the position representation 
ER „ay Uo s XN) = (X115 ۰.-9 N pel» 
is a product of one-electron wave-functions. 
rar (21,..., XN) = ee (%4).. eh (хм) 
(&,, ..., &y — spinor labels of N electrons). It can be proved then that 
СУ vy (а) фе (x) V?» = УР QS (x^) (17) 


N — number of medium electrons in state 4), where 


9 165 > e _ 
E (= Au ©) e] Vj (x) =0 


We expand the configuration one-electron wave function (in state ¥) in eigensolutions 


р(х) = u,(x) exp (—iE,t/h) of the Dirac equation with potential A®*(x): 


1 
Ра) = У ора). (18) 
= _ 
The coefficients с, give the initial distribution of electrons in the medium. With 
our former assumption of equilibrium this distribution is preserved constantly in the 
medium. 


Using succesively (17), (18) and (8), we find after some ln (cf. Rzewuski 
1952), that formula (16) takes the form 


DA)» = faye’ KEG, x n0) AEG CT, (19) 
where j 


+ 


0, — UWr, + © 


KEG Za) = 2 ур р, ЕЕ 
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U) aur (%))* са (20) 


| w O, TO 
here (æ) = (x, i), E, =ho,, and P, = |c,|? is the distribution function of electrons 
of the medium in state У over r states. The operator (A7 (x)? has been averaged above 
over the initial states (phases) of the electrons, using the assumption of a random 
distribution: 

с, Ci (orl Ory 
Moreover, in the above calculations, we have excluded succesive transitions through 
intermediate states r of the medium electron. These transitions are characterized by 
the energy conservation law in the intermediate states: 0, —o, + @ = 0. In reality 
these transitions do appear for resonance frequencies. In that case the Weisskopf- 
Wigner method must be used for calculations. In order to exclude from our consider- 
ations the case of resonance frequencies, we shall assume that the radiating atom 
differs from the medium atoms. 
In formula (15) operator 4," (x) has the form 


A (x) = SB Ud (21) 
^ ЗИ“ 


w? - 20560 v С 
where k? и? =e =1 (ЕЕ). (c,e,)), V = 12 is the volume 


Uh = 
. LI = LI * * 
of the normalization cube (Rek — I m, m — vector with integral numbers as com- 


ponents), and a* are photon absorption and emission operators respectively. The 
normalization factor appears in consequence of the commutation relation 


E (x, t), поа Е 


at 
We can now put equation (19) into the form 
| № 
0442 > = (p? (s B), 4, C0. (22) 
where | 
А E Ang a 
(ie Gp, = [are ket (3) 


describes microscopically in the considered approximation the apparent mass of the 
photon (without self-mass) in the given crystalline medium. Averaging over a crystal 
of volume G, we obtain 
e 1 Fani a 
(i? @„ = d | ax? Dir (24) 
G 

The coefficient и in formulae (5) is now so adjusted that A, (x) represents „mac- 

roscopically^ the plane wave propagating in the given medium, i. e. that 


, 


^ 040 = G20)» A: 
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Inserting here formulae (15) and (21), we obtain the expression for the macroscopic 
zt 


apparent mass of the photon u (without self-mass) 


= =з = 
и? TH eu (и? ()) €, 


Let us notice that we can write 


со? =, na 
з= — — и = — : 
M 5 


where » is а real number, k a complex number und 


2 


E } со: = 
па а = 1 eut p e. > 


Formula (25) determines in the considered approximation the macroscopic (complex) 


refractive index. For all directions of t propagation k and polarization e, there exists 


a certain value of the refractive index n TO е), which depends besides on the magnitude 
Re k = 2л|2 (double refraction and dispersion). 

We remind that we can without loss of generality gauge А, (x) so that A,(x) = 0. 
Then e, = 0, thus е2 = 1 and from the Lorentz condition ©7210: 

In further calculations we shall-assume (for simplicity) that the lattice is cubic. 
Generalizations do not lead to any essential difficulties. We shall denote by а the edge 
length of an elementary cell. 

We shall now use the assumption that electrons in the medium are assembled 


in vertices of the crystal lattice. From that assumption it follows that function K KE (xx ,;0) 


given by formula (20) has near the e lattice-points с ma(m — vector with integers as com- 


ponents) the character of 6 (x — а) and 6 (a = a), andin other points it is equal to zero: 


КҖ (х, x; w) ет ( SDK — а) 8) —a')e* й@-) fa «fon (x, x x 300), 
a ag 4 
where the integrals expand over regions Az and А > of atoms in the lattice-points a 
and a’. 
From formula (23) we obtain 


(u* (x, B), p» à (x — а)е+®@-@) [ dax ^ dy' KE(x,x';o). (23°) 


« 


Ита 


+ +. 
We can see that (u? (x, k))„ is a periodic function with period of the lattice (we assume 
here that the crystal has infinite extention and all atoms in the lattice-points are identi- 
cal). From (24) and (23’) we obtain further 


= 1 177 
б Oe == em» fas [а KEG e — eo 
ad ре 
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Now summing over a extends to all lattice-points of the domain G, over a’ to all lattice- 
points of the whole medium. 


To solve equation (22) we seek a solution of the form 


SF. ES 
2 -> 
CAS СУ = = Ax (a (x х)е ipe АЁ (x ) о IV 25 €, ettkx (i == =, en = 1),26 


where fE) is a periodic function with period of the lattice, i.e. 


+ > .» + 20> 
= = У 70е, = 2 т (27) 
a q 
(m — vector with integers as components). From (22) and (26) we obtain 


= 2 JS 
(^ + iY = x в) Е en Gee a (28) 


We shall solve equation (28) by applying Green’s theorem (cf. Born 1933) to the domain 
g of the elementary cell of the crystal 


| 
f 


g 

substituting there u — exp (—iqx), v — = f*(x). We obtain then (in consequence | of 

he fact that the surface integral vanishes due to the periodicits of the function f +(x): 
1 


ГЕО) = er | (X E) wer et (29) 

g 
where К° = oc, g = a3. We can replace the integral (1/6) 1] d4x in formula (29) by 
(1/G) Ја, where С is a domain of the crystal containing r4 integral number of ele- 
d cells (this is allowed, рс, the function under the integral sign is periodic). 


From the fact that function и? (х), has the character of 6 (x а) near the 


EIL. a = ma and vanishes elsewhere in obtain 
wal 3E 3*€u (u (x, É)) were “ie = ج‎ С?З! на | лат (x, Ё)) „е, 


n al de (x, й) wer as о (Bue, = zi (1 M 13), 


Гог ga = a mam! = 2zmm', and exp (—iqa) = 1. Finally from (26,) (27), and (29) 
a 


we get P. 
E hc + TOS И 
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This solution corresponds with Ewald's solution (1912) from classical crystal optics. 


The function 
ейх ‘ho + 27 + 
چ‎ е 31 
2 (j 4 — № ( 3 i) D 
q 


can be represented (Ewald 1912) as a superposition of spherical waves outgoing from 
the lattice-points 


ik |x—a] > > = 
S NY + ; (а = T , 
— 628; = am == a). 21 
4л 2 lx —a| 0) 


To obtain the convergence of these series (in points x а) we must assume (the medium 
is infinite) that Im k° [m k|, and after summing up we must find the limit for 
Im k? — 0, as k° = о |с is a real number. 


3. The emission in a crystal 


We shall now investigate the probability of (spontaneous) one-quantum emission 
in a crystal. 
The electrons of the radiating atom will be treated configurationally. We have 


|; 


— 


| Јна Жог > Se 
ip (x) = eh dis (x)e % A JA (x) = ec а Ô (x — x) (32) 
(H^ is here the Hamiltonian of the radiating atom, Z — the number of electrons in the 
atom interacting with radiation). 
Next we gauge A (x) so that 44(x) = 0. Then e, = 0, and from (30) we get 
From (11), (32) and (26) we obtain (for a flat hypersurface c) 


2 
d cm (og 0a +9): .] 
VA) = 2. yr =з а, 
СРР) = = В| Е а 
where A1 (x) is obtained. from Te (x) defined by formula (30) by dropping the emis- 
sion operator a^, and |A> = Y4|B> = YA are the initial and final states of the 
radiating atom with energies E, =hw, and Eg = hw, respectively. 
The probability per unit time and unit solid angle of photon emission (in the 


transition 4 > B) in the direction (3, y) and with the polarization e is given by the 
well known formula 


w (8,98) = = 7 (‹ TOX TOD] ee (83) 
um k) _ V(Re k)? d(Rek) 


in which : cew) = o (Rek) TUS ERU 


Ы 
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ne w? — — 
tb e 2 
and k = Eus quc s Ore: Here the apparent mass of the photon 


и? =e (n?) „е, depends slightly опо, because we have excluded from our considerations 
the resonance frequencies c = |w,—©,| (cf. (20)). Therefore we have approximately: 


dk|de = c[c*k = 1]пс, and 
(Ren)? Го? 


о (о) = ne Ga)? Е (34) 
From (33), (34) and (30) we obtain 
w (9, ф, е) 
А-В l 
(Ке n)?o Z l2 а ea Êz i 
=o Reel cay ae) ur (35) 
vat = 
C 2=0 4-%B 
no 


where & = е?|Алћс is the fine-structure constant and k = —- 
c 


If the radiating atom is placed in one of the lattice-points of the crystal, e.g. 
in the point a = 0, then in equation (35) we must replace function (31) (which is 
equal to (31) with the lower signs) by the function 

g eile a] u 

— ре!" 36 

4л سے‎ |х — ај $ (36) 
a+0 


of the radiation field externally interacting with the atom. This function is equal to 


У ах 2 1 eia x 

А ——4— Ohren e 

= (q + k)? — [°2 4л 31 (q + hk)? — °2 (36) 
4 


: : * : В ж су: н. 
in the integral 9 changes continuously, in the series q = — m J. In the above inte- 
a ; 


gral the path of integration in the complex plane passes over the pole —k° and under 
the pole К°. 
Let us observe that this integral does not depend on К. Thus the divergence of 


this integral at the point x = 0 is compensated by the divergence of the series at the 


> 


point =0 independently of the value of k. We shall use this fact when estimating 
the probability of emission. | 
If the wave length A = 2n/Re k is greater than the double edge length of the 
crystal cell a, then in functions (31) or (36°) the term with m = 0 is much greater than 
the following ones and we can approximate these functions by the expression 
etikx 
; Ze) 
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Equation (35) takes now form 


2 
4 (R ec Ы » 
DIOE Ф, е) =“ E = In]? aaz Уа FEE © € КЕ Кез ; (207 


Formula (35’) differs from the corresponding emission formula in vacuum only by 
the fact that in (35°) the refractive index n appears in the coefficient and in the exponent. 
The difference in the exponent will not appear for electric dipole radiation (where we 
put exp (— ike) = 1) but only for magnetic dipole and electric quadripole radiation 
(where we put exp (іка) = 1 — ikx). The probability of emission for electric dipole 
radiation will be multiplied in comparison to the probability in vacum by the factor 
(Re n)?|n|?, for magnetic dipole and electric quadripole radiation — by the factor 
(Re п)?. 

If the wave length A is smaller than the double edge length of the elementary 
cell (but greater according to our assumption than the domain of the lattice where elec- 
trons interacting with radiation are assembled), then besides the term with т = 0, 


then terms for which the value of q is near to -F 2k (and А = 2a |т)) will play ап 


: JANE. те, = Der 
important part. For simplicity let us assume that Rek has the form — m,(m, — a ve- 
a 


ctor with integers as components). Then, using only terms with m =0 and m = 
F 2m,, we approximate the function (31) or (36°) by 


ех | eFikx 
2 
w 
in) 
g^ 


Formula (35) will then take the form 


Ке п)Зо› 1 »‏ ج 
PAM ф,е) =a б: po «Bl у q;e(e 7 ix; 4 A), (35”)‏ 


= 1 ("A 


- 2n > ; 
where Rek — TQ Thus we can see that for electric dipole radiation the proba- 


n Fits ; - 2л - 
bility of emission of a photon with the wave vector Вей = — m, distinctly increa- 
a 


ses in comparison with vacuum (4(Re n)? п? times in our estimation), whereas for 
magnetic dipole and alectric quadripole radiation the probability of emission 
strongly decreases (it is equal to zero in our estimation). For other wave vectors 


Re k the approximate value of the emission probabilty i is contained between the values 


(85) and (35”). 
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4. Final Remarks 


The considerations contained in this paper can be transposed to the case of 
a radiating nucleus in the crystalline medium, if only the wave lengths of the у -ra- 
diation are greater than the dimensions of the nuclei of the medium, for only then 
the nuclei can be treated as points. But then we must neglect the interaction between у 
-radiation and electrons of the medium, because the clouds of electrons diffuse the 
points of the crystal lattice. 


This paper arose from the initiative of Professor A. Soltan, who was 
interested im the question of the influence of the crystalline medium on 
emission probability. 

The author is also indebted to Professors W. Rubinowicz and Г. 
Infeld, and іо Dr M. Günther for valuable discussions and their kind in- 
terest in his work. 


КРАТКОЕ СОДЕРЖАНИЕ 


Круликовский, Теория излучения в кристалле 


Настоящая работа содержит теорию излучения атома погруженного в кри- 
сталлической среде. В первой части автор выводит уравнения рассматриваемого 
znacca проблем из формализма квантовой электродинамики. Появляется при этом 
в формализме кажущаяся масса фотона. Во второй части автор рассматривает, 
при некоторых упрощающих предположениях, определение кажущейся массы 
фотона и решение уравнений поля излучения в кристалле. Полученное решение 
согласуется с классическим результатом Эвальда. Третья часть содержит оценку 


вероятности эмиссии излучения в кристалле. Для длин волны вида A= 2a|m| 


(a — длина ребра элементарной ячейки, m — вектор, которого компоненты — це- 
лые числа) возрастает вероятность эмиссии для дипольного электрического излу- 
чения, а уменьшается — для дипольного магнитного и квадрупольного электри- 
ческого. 
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The generalized bilocal wave equations restricted by reciprocally invariant supple- 
mentary conditions describe families of particles with various spin and mass values. 
In contradistinction to a previous supposition no direct connection with the isotopic spin 
theory exists. А close connection with Fierz's theory of higher spins is shown, The ques- 
tion of interaction is briefly discussed. 


1. Introduction 


In a previous paper (Rayski 1953) denoted hereafter as (I) a bilocal generaliza- 
tion of the Schródinger-Gordon and of the Dirac equations has been proposed that 
leads to a discrete mass spectrum of the elementary particles, provided some reci- 
procally invariant supplementary conditions are introduced. It is supposed that the 
generalized Dirac equation may be used for description of the nucleon family while 
the generalized Schródinger-Gordon equation (with a pseudoscalar wave function) 
may take account of the pion family. > 
In the present paper the generalized Schrödinger-Gordon equation is investi- 


gated in some more detail. This may be written in the form 


(р? + akî + х?) y (x, r) = 0 (1.1) 
where а? is a dimensionless constant (denoted in (I) by 4C), x,, 7, denote Yukawa’s 


variables, and 
pics k ЕЕ Е Arc (1.2) 


In order to get a discrete mass spectrum the 8-dimensional bilocal manifold of the 

variables х, 7, must be limited by means of suitable supplementary conditions. As 

shown in (I) it is highly probable that these conditions possess invariance properties 

characteristic for the bilocal geometry, i. е. besides the postulate of Lorentz inva- 
(107) 
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riance they satisfy Born’s (1949) postulate of reciprocal invariance. In terms of Yukawa’s 


variables the reciprocal transformation is 

4-1; 1 Ld xl 

Ar, > Apy Ар, > — Аг, (1.3) 
Where A is а constant of dimension of length". The most general supplementary con- 


ditions invariant under (1.3), and scalar with respect to the Lorentz transformation, 


are 
(n +p) = by, (т, — рф = by, (1.4) 
or equivalent 
гру = 0, (ra + pip = by. (1.4) 
In (1) it was assumed simply that b = 0, now we shall put more generally 
b>0. (1.5) 
2. The problem of constraints 
Special solutions of equation (1.1) are plane waves: 
y(x, т) = eup ( р, г) (2.1) 
where the four-vector p, is to play, as usually, the role of momentum and energy of 


particles associated with the field variable y. The general solution may be represented 
as a Fourier integral 


wr) = f dpeéhewp(p, 7) (2.2) 
where y(p,r) satisfies a differential equation with respect to the internal viariables 
r, namely 

(k2 + pê + #3 p (pr) =0 23) 
and the algebraic supplementary conditions 

ры (pst) = 0, (r + pf) y (р, т) = by(p.?). (2.4) 


These conditions mean constraints restricting the function % (р, г), which has to van- 
ish for all points 7, except 

ие Tate Dim De. (2.5) 
The manifold of points in the л, — space restricted by (2.5) is a twodimensional mani- 


fold; in particular for p = 0 it is the surface of a sphere in the r- -space with a ra- 
dius r given by r? = b — ра. 


We may take account of the constraints (2.5) explicitly, i. e. investigate, in- 


1 Later on a system of units will be used where 4 = c = Д = 1, so that every physical quantity 
may be expressed as dimensionless. | 
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stead of y (p, г), a new function g (р, г) depending only on the two degrees of freedom 
in the r-space. In particular, for р = 0 the function g depends only on the angles 
9, Ф оп a sphere with radius Vb — p3. Obviously, the new function Ф (p, г) will not 
satisfy the original equation (2.3) but a new equation wherein those infinitesimal 
translation operators №, that are inconsistent with the constraints have to drop out. 


In this way we get a modified wave equation which may be interpreted as the result 
of appearance of the reaction forces К 


(ak + ри + +?) Ф (pr) = Ке(р, т). (2.6) 
R assumes a particularly simple form in the case p zx): 
R=a%(k2 +k) for p =0. (2.7) 
In consequence of 
dle ce ato 
2 = — — — — 
SES (5 3 r 2) 2 


equation (2.6) reduces to 


р х 
(5 s+ ام‎ + (۶ =0 гр =0, 74 =0, 7* =Ь—р, (29) 


r“ 
IET д 1 A 
= — 2 A ل س‎ = 5, . 2.10 
Ар C55; ine ere aus [240 


The relativistic generalization of the operator (2.10) is cf. e. g. O. Hara et al. (1954) 


where 


St = oh (2.11) 
Ope 
where 
Vay» = Pa Rw + Py Вл T Pr Faw (2.11’) 
and 
ПЕ Tip Be ate By | LED 


The relativistic generalization of equation (2.9) is 


2 
(5 8° E pu 2 ф(р,)= 0 for rupu = 0, 1y = b — Pa (2.12) 
Tu 
where S? is given by (2.11). S? commutes with 1/72 which secures the Hermitian char- 
acter of the operator acting upon ф in (2.12). "às 
We may replace in (2.12) the eigenvalue р, by the operator p, while py appear- 
ing in the denominator of (2.11) may be eliminated with the aid of the second condi- 
tion (2.5) Ё | 
2 
An cor Aw (2.13) 
j ау 
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so that 


2 
(x: Р E SE е) p(x, т) = 0. (2.14) 


In this case, however, the constraints assume an operator from 
= SR уу 2 2 
r, Py = 0, m= b р, (2.15) 


with а symbolic meaning. 

The operator S? may be written explicitly, which yields a lengthy expression 
that simplifies considerably when taking into account the first of the conditions 
(2.4). Putting all the r,s to the left, we get 


Т 
s? = СЕ k, TR zi s). (2.16) 


This last expression is simpler than the original nes but its Hermitian character is 
no more evident. Introducing (2.16) into (2.12) we have explicitly 


1 2i 1 
e [morte + Fa rutu (рива Fp + ven =0 en 
ГА TA Pa 
from where the term R representing the reaction forces may be inferred immediately. 


3. The solution of the wave equation and its interpretation 


The solution of the generalized Schrédinger-Gordon equation may be found 
easily. Since the eigenvalues of the operator S? are / (J + 1), equation (2.12) yields 


the algebraic relation 
2 


— b+ D+ pat 0 (3.1) 
b= py 
and the square of the rest mass M? = — р? follows 
2 
eae (>) оО оо (3.2) 


The negative sign of the root cannot be realized since for b >0 it yields imaginary 
values of the rest masses. The solution of the generalized Schrédinger-Gordon equa- 
tion (2.14) is 


р) = У Je Фар ei 5(p2 + Mp) ay (р, SEN p) S (p) = (83) 
Im 


where S,,, (p) is the spherical harmonics (in the r-space) transformed from the rest 
system to the system of coordinates in which the momentum is p. In consequence 


of the appearance of a delta function, the coefficients аы depend (besides on р) only 
on the sign of ру. In a classical theory aj, are arbitrary numerical coefficients while 
in a quantum theory they are suitable matrices. 


Ш d 
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With regard to the role played by the operator S?, the present formalism may 
be said to represent a rotator model of the elementary particles. The states of the par- 
ticles are represented (in their respective systems of rest) by spherical harmonics 
on a sphere with radius Vb 4- МА where М, is the rest mass of the particle. The state 
of rest of a particle with the mass М; is degenerated of degree 21 + 1. The 27 + 1 
states denote different states of "polarization" of the particle. 

Integrating with respect to pg and denoting 

Am (р, sp) ы, (Р), (3.4) 
аи (Ps —) = by (р), 
we get 


1 ip, (Dx = Fh 7 ха”. ЕУ, Ré 
в? = 2, f EN (ен а, (р) Sy, (D) + ei Bin (— p) Sm (—p)) 
l,m 
(3.5) 


where 


P() =p, poll) = Vp? + Mr. (3.6) 
Introducing a box of periodicity in the х-зрасе, one may represent the solution 
by means of Fourier series instead of Fourier integrals with the aid of the substitu- 
tions 


1 1 Be А 
m Je HL. Fg > = 6n 
pi sr бф 


which is more convenient for interpretation 


1 > а s К > = 

p (г) = M 2 (eibi. A alpi) Sim (pj) + eO» Bim (— Pi) Si( — р). 

lm fi (3.8) 

Since the generalized wave equation is invariant under space and time transla- 

tions (the internal variables r, are left unchanged by translations) there must exist 

constants of motion in the form of a four-vector with the meaning of momentum 

and energy. These are simple generalizations of the well-known expressions of the 

usual formalism. Since the element of integration over the independent variables in 
the r-space is 

4% (rp) ó(r? — b + p°), (3.9) 


we may easily guess a possible form of the momentum operator, namely 


P = — fdv | dex $* (от) 180) д  —5 + РУ grad, # (2) 
VM (3.10) 


- [д (rp) à (2 — b + p?) grad, p* (x, 7)] Pr). 


` Introducing (3.8) and taking account of the orthogonality of the spherical harmonics 


- / der à (гр) д(т*—Ь + P?) Sim (Р) 5 (p) = б дит» (3.11) 
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we get 
Р У У pi (Aim (Ps) Am (2) Bin (Po) Bin (PO) (3.12) 
М lm fj 
Ban e 1, CP) Ав, (p) + Bm (p) B Bi, (p) (3.12^) 
lm Pi 


A possible form of the energy operator is 
Е=+ [ar [ам {(#*(х, г) [ö(rp) 6? — b + p?) Ф(х, 7)] 
y 


+ [6(rp)ó(r? — b + p?)9* (x, 7)] Ф(х, r) 

+ grad, g* (x, r)[8(rp)8(r? — b + р?) grad, p(x, 7)] 

+ [9(7) (72 — b + p?) grads g* (x, r)] grad, p(x, т) 

+ g* (x, т) [ó(rp) 9(r? — b + p?) M(x, 7)] 

+ [9(rp) 802 — b + р?) м?р“ (x, 7)] p 7)) (3.13) 
where the mass operator M? is from (2.12) 


giu 
м? = 5° T. (3.14) 
Tp 
Introducing (3.8) into (3.13) and taking account of (3.11), we get 
B= ХУ EMP Ain ALI) + ВЫ РӘ BigP). 815) 


lm Pi 
The comparison of (3.15) with (3.12’) shows that P and iE form a four-vector. Its 
components are evidently time independent so that we have succeeded in con- 
structing the most important constants of motion. Similarly an operator may be con- 
structed which may be expressed in the form 


=e УУ anto) Ар) — Bio) Bm Po) (8.16) 
lm bi 
with the meaning of the total charge of the system. In consequence of a complete 
similarity with the usual field theory, the quantization will consist in replacing the 
coefficients A,,, ( р) and Bry (р) by annihilation (and their conjugate by creation) орега- 
tors so that 
(р) = Ар (р) Ар). Nap) = Bip) Bap) (3.17) 
assume integer values with the meaning of the occupation numbers of particles 
with momentum р, rest mass M,, energy ү р? + М НЕ charge + e, and "polarization" 
given by m. 
In this way the generalized Schrédinger-Gordon equation is ВА and the 
momentum representation is constructed explicitly. 


in the form 


Families of Elementary Particles 113 


4. Connection with the theory of Fierz 


Fierz (1950) has shown that Yukawa’s (1950) non-local field theory is equiva- 
lent to the theory of fields with higher spins (Fierz 1939), at least in the interaction 
free case. Now, our formalism is closely related to Yukawa’s and therefore also to 
Fierz’s theory or fields with higher spins. It can be shown that the particular solu- 
tions р(х, г) appearing in (3.5) are associated with particles with spin J while the 
quantum number m denotes the projection of the spin on a preferred axis. 

To show this let us solve the equation (2.12) in an explicitly covariant way. Since 
the solution of equation (2.12) is to be a pseudoscalar dependent on p, and r,, we may 
consider a sequence 


1 1 1 
AL) , т) ا‎ Ги» = y ) poe 4.1 
Pol P> г) VN, (р), qx(p.7) VA, AP) Tu» Фа VN, ay ( p)rur (4.1) 


where а, (p) are pseudotensors of the rank 0, 1, 2... Suitable normalization con- 
stants are denoted by Nj. It may be easily shown that, under suitable subsidiary condi- 
tions, (4.1) represents a sequence of solutions of (2.12) corresponding reprectively 
to the mass eigenvalues Mo, Ми, etc. In consequence of the constraints the coefficients 
а... (Р) will be independent of P (which is a constant), moreover, they cannot be 
quite independent from each other. The coefficient of rank 0 is of course independ- 
ent, but only three of the four coefficients a, of rank 1 can be independent. This 
follows from the condition p, 7, = 0. Dividing a,(p) into a longitudinal part Cp, and 
an orthogonal part, we see that the longitudinal part must disappear from g, and 
there remains only the orthogonal part. Thus, we have a subsidiary condition 


p, a, (p) = 0. ` (4.2) 


Going over to a,, (p), one can see immediately that only a symmetrical pseudo- 


_ tensor comes into account: 


a, (p) TH a, (р). (4.3) 
Four of the ten components of this tensor drop out on account of the constraint рт, = 0 


P. „а„(Р) =P аР) = 0. ч (44) 
Finally, on account of the second condition r2 = const, the pseudotensor a,, (р) 
must contain no term of the type C ô,» since this yields a constant a = Cr) which was 
already taken into account as фо = а. This last subsidiary condition may be written 


а„(р) = 0. | (4.5) 


The conditions (4.3) — (4.5) reduce the number of independent a,,(p) to five. Thus, 
in consequence of the subsidiary conditions which may be written generally as 


# Q., и... hje c Ч, My von Hie , (4.6 ) 


+ 


Ри; Чин Os 14.67) 
m 0, (4.6''") 


The separate terms of the sequence (4.1) contain exactly the same number of inde: 
pendent coefficients a,, as spherical harmonics. Introducing the separate terms 
(4.1) into (2.12) and taking account of (4.6) it may be verified that they satisfy (2.12) 
respectively to the values Mo, М,,... given by formula (3.2). 

These solutions are orthogonal 


[dtr d(rp) ô (è —b + p?) plp r) pp (pr) =0 for LAT (4.7) 


which is immediately seen in the case of J, l’ of different parity. In the case of equal 
parity (4.7) follows from the subsidiary conditions (4,6). Let us consider e. g. the 
case 1 = 2, Г = 0. We have to compute in this case 


а„»(р) а (р) ТУ (4.8) 
where 
Tun. = {dtr 6 (rp) ò (r? —b +p) ry, т, (4.9) 
is a tensor dependent on p,. In particular I» must be of the form 
lig api OO (4.10) 


Introducing (4.10) into (4.8) we get zero in consequence of (4.4) and (4.5). An analo- 
gous proof? applies to higher values of the indices /, Г. 
By choosing suitably N,, we may normalize the solutions so that 


1 
Hf d'ró(pr) òlr? — b + р?) gf (p,r) = $n is oes (4.11) 
In particular, 
АЙ (4.12) 


where I is a special case of (4.9). N, is found from (4.11) together with (4.10) 
and (4.2) 


1 
N, аа, = Auty (4.13) 
or 
e 1 I 1 vt 
1 2. 9 bi p» Pulp, |. ` (4.14) 
Similarly N, follows from 
1 1 
N, Яр Goo J РОДЕ 3 w а» (4.15) 


Ux xu у eee MEM ТИ" 


* A general proof follows from the theory of group representations. 
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where pue must be of the form . 


Is ESL D. DLP. рер (Pu Р» 0 рр чт) + C (0% Ont One Die Ou 0M) 
so that (4.16) 

Na = Ao (4.17) 
where c may be found by evaluating (4.9). Similarly the normalizing factors of higher 
order terms may be determined. They are multiples of the coefficients of the term 
containing merely Kronecker symbols. 


The sequence (4.1) forms a complete set. Writing the general solution as a Fou- 
rier integral 


1 
ol, т) 3d far کم‎ ôl p? ам? ) УМ, Чи... му Тш Ги... Ти (4.18) 
1 


it is seen that the coefficients depend (besides on p) only upon sgn pọ. As a matter 
of fact we have again two independent sets of solutions with 


Е у 
Be "m, is ANS 


The analogy with the solution discussed in Section 3 is complete except for the 


(4.19) 


normalizing condition. Therefore we get analogous expressions for the momentum, 
energy, and charge, in particular 


Q = УУ OG) =e УУ Jy Acar Ann) В. PO Bui O) 

mà = (4.20) 

The factor !/1! is to secure that (owing to the symmetry of pairs of indices) equivalent 
expressions appéar in (4.20) without unnecessary multiplicity. 

In spite of the similarity of (4.20) with the usual expression for the charge we are 
not allowed to replace the coefficients 4,, and Ain simply by annihilation and. 
creation operators since they are not independent. Let us consider e. g. the term (0 
in the case p = 0. Owing to (4.6’) or (4.4), we have in this case 4,4 = Ay, = 0. After 


. eliminating these coefficients, we are left with 
0, (0) = 5 CTE НР ре на. (4.21) 


The terms with two different indices may be replaced by annihilation and creation 
operators and denoted by 

Aiz = رو4‎ = Аз Аз = Азу = Ag А» = Аз» = А}, (4.22) 
on the other hand, the remaining terms with equal indices are not yet independent 
in consequence of (4.5). Introducing A, and A, by 


Ay Ass = A, =; As (4.23) 


1 2 
үз фсе Аат уз 3 


Ay = 
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we see that (4.5) is satisfied and Q, reduces to the standard form 
5 
О) =e У (4; А;— В; В). (4.24) 
i=1 
Thus, A, ..., 45 ара their conjugate are annihilation and creation operators, but not 
the original А. 
The above discussed form of the solution is directly related to the theory of 
Fierz. We have seen that energy, momentum and charge appear as additive terms of 
a set of fields whose Fourier coefficients are 


A (PO. o) (4.25) 


in the cases of Fourier series and Fourier integrals respectively. Thus, we may consider 
the equivalent set of fields 


Pray) = Јаре?» д (ра МР аи, (р) (4.26) 


satisfying the subsidiary conditions: (i) symmetry with respect to every pair of indi- 
ces, (ii) vanishing of traces, and (iii) vanishing of divergences: 

Ea (x) =0 (4.27) 

Әх, Фи... о... . . 
Fierz (1939) has shown how to construct for such fields the momentum and energy 
density tensor, the angular momentum density tensor, and the charge and current 
density four-vector satisfying the continuity equations. From the expression for the 
angular momentum it follows that the field (4.26) describes particles with spin l. 

In consequence of such an equivalence it follows that our formalism describes 
a family of particles not only with different masses M, but also with different spins 
denoted by the same index l. 

Arranging the particles with different spins into a family with prescribed masses, 
our formalism differs from that of Fierz where particles with higher spins are jntro- 
duced entirely ad hoc and do not yield any connection between masses of related par- 
ticles whatsoever. 


5. Outlook 


(a) Other families. Until now we have concentrated our attention upon the 
generalized Schrédinger-Gordon equation with a pseudoscalar wave function inclu- 
ding automatically all higher pseudotensors corresponding to higher integer spin 
values. The next possibility is a Schrédinger-Gordon equation with a scalar wave 
function which automatically includes all higher tensor fields. Another possibility is 
given by a generalized Dirac aquation 


ә ә | 
(к. ME +») р (x, г) =0 T | (5.1) 
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where а, and Û, are two sets of Dirac matrices. The two sets may either commute or 
be identical. We have still further possibilities, e. g. we may assume that the as 
are Dirac matrices while the 8's are Duffin-Kemmer matrices composed in a redu- 
ced form, or interchange the roles of the a's with the f's, or assume that both 
sets of matrices are Duffin-Kemmer matrices. Each case corresponds to a family 
of particles with a given mass. spectrum and integer or half integer spins as- 
cending either by integer or half integer values. The constants a and x may assume 
different values for different families. 

(b) The problem of interaction Às yet we have considered only free fields. 
With regard to the question of interaction there are two distinct problems: (i) the in- 
teraction between different families, and (ii) the interaction within a family. The latter 
may be asecondary effect through intermediary of other fields not belonging to the 
family, or we may assume a direct interaction by adding non-linear terms into the 
wave equation (as suggested by Heisenberg 1953 or Hara at al. 1954). 

The very existence of an internal structure provided by the variables r, makes 
it plausible that the interaction between two families (perhaps with the exception of 
the interaction with the electromagnetic field) is of a non-local character. Let us 
consider an interaction between a complex scalar field y (x, г) and a real scalar field 
U(x) assumed as local for the sake of simplicity. As, from a broader point of view, 
the non-local field quantities may be regarded as matrices 


, Lf 


«хх > where ЖЫЛ 


асв = х, x! STR : (5.2) 


a plausible assumption for the interaction term Wof the action functional is 
Ay x" 
f [aw fe Јасин u( t ) (5.3) 


f fa. Јах" Јак" lv*| ай > ВА’) <=" In x). (5.4) 
Alternatively, we may regard the density 
o (xr) = vt env = Gral» 68. 


or 


as a matrix and try the assumption 


p | di [| dia (UG!) at lel» + Cx eli DU): (5.6) 


In any case we have to do with a non-local interaction smeared out, in this way or 
other, over the surface of a sphere representing the particle. In consequence of the 
fact that the radius of the sphere increases linearly with the mass constant, the smear- 
ing out is more efficient, and the interaction weaker, for higher mass values. This 
makes understandable the comparatively long life time of the heavier particles. 
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(с) The mass spectra, The values of the rest masses of a family are determined by 
the constants a, b, and x. Assuming provisionally that b is negligeable, we get in 
the case of the Schrédinger-Gordon equation from (3.2) 


2 xi y a, ЧАКЫ auis 


Assuming that this spectrum applies to the pion family, we determine x and a from 
the two lowest mass values 


Mo = x = 273 т, M, = 973 т, (5.8) 
whence the higher mass values follow 
M, = 1270 m,, М; = 1506 m,, etc— (5.8’) 


The existence of particles with about the same masses seems to find confirmation in 
the experimental results of Fowler (1953), Perkins (1953), and others. 
Starting with equation (5.1), i. е. assuming a mass operator of the form 


д 
M-ap +y 6.10) 


where f, are Dirac matrices, we get the following mass spectrum 


V | Ton 
(ren и 


where the two values 4- correspond to the same spin index j. Assuming tentatively 


|М, = . 6.11) 


that (5.11) describes the nucleon family, we determine the two constants from 
M,- = 1839 m,, Му + = 2182 m (5.12) 


and find an interesting relation between the values of ‘the constants a for the pion 
(a) and the nucleon (ay) family 
2 
ау, u 10 
( 4 (5.13) 


where 4 and 10 remind us of the number of rows (columns) of Dirac and of Kemmer 
matrices. The higher mass values follow: 


M, = 2670 m,, М, = 3010 m,, etc— (5.14) 


The value 2670m, finds the best confirmation in the results of King at al. (1953). 
On the other hand, the lack of a mass value of about 2300m , (reported by several 
authors) should be noticed. 

The above tentative determination of the mass eigenvalues should not be taken 
too seriously: first of all the assumption b = 0, though the simplest, is still arbitrary, 
then there remains the question of x. It seems plausible that the „Баге“ particles possess 
à proper mass determined exclusively by the operator 5* (or by a corresponding oper- 
ator in the case of half integer spin) without any additional constant x. Most probably 


Families of Elementary Particles 119 


the constant x characterizes the ,,dressed“ particle, i. е. is a self-mass brought about in 
consequence of interaction. Now, the self-mass can depend upon the the total mass, 
i.e. can be a function of the spin index 1 or j. It is only a rough approximation to assume 
for x a value that is valid only for the lowest member of the family. Nevertheless, in 
this way we get at least a qualitative agreement with the experimental evidence and a cor- 
rect order of magnitude of the expected mass terms. In particular, we obtained а reason- 
able order of magnitude for the characteristic length 4 introduced by the postulate 
of reciprocity (1.3) 


= 6,8۰10 15 cm, and a reasonable order of magnitude for the 


s 


self-masses : 
x (pion) = 273m,, x (nucleon) — 343m,, (5.15) 


larger by two orders of magnitude than the self-masses of electromagnetic origin. 
This agrees with the ratio of the coupling constants (f/e)?. Besides the interesting rela- 
tion (5.13) we notice another surprizing numerical relation in (5.15): these two values 
are nearly equal to 2.137 and 2!/,. 137. 

(d). The question of relation with the theory of Pais. We notice a striking simi- 
larity with the formalism proposed recently by Pais (1953). In both formalisms the 
mass operator is related to the group of rotations in a three-dimensional space. How- 
ever, Pais assumes a twodimensional sphere imbedded in an abstract space and inter- 
prets this as the space of isotopic spin, while in our formalism it is a sphere in the 
usual world. The variables r, transform simultaneously with the x,s and, consequently, 
our mass eigenvalues are intimately connected with the ordinary spin. Thus, we see 
no connection with the isotopic spin whatsoever, so that we have to withdraw the 
tentative supposition made in (I) of a possible connection of our formalism (in partic- 
ular of our matrices f,) with the isotopic spin. 

If, on the other hand, we abandon the connection of the variables r, with the 
usual space and let them transform quite independently of the variables x,, then we 
may interpret the additional spin as the isotopic spin. However, in such a e there 
is no reason for assuming reciprocal invariance. Several kinds of constraints other 
than the reciprocal ones might be thought of, or even other kinds of underlying space 
(instead of the four-dimensional pseudoeuclidean space of the variables r,) might be 
assumed in this case (Pais 1954). 


Note added in proof: 
A similar bilocal generalization of the Dirac equation yields a family of particles with half- 


-integer spins j and a mass spectrum of the following type 


C 1 Ps 
questus WA ERDE I 
М.а] / tz: i t ^w 


where | is a fundamental length and C is a dimensionless constant. The same formula applies also 
Ar ы eee In this case the spin is j = 1/,. 
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———— — mu 


On the other hand, as remarked by F. Cap, it is also possible to describe particles with half- 
-integer spin by two-spinors satisfying a second order (Schródinger Gordon) equation. In this case we have 


GE 1 1 1 3 
D EDS Bee aes , ا‎ II 
Mose ( ~) (++) И (Ш) 


Adjusting the value C/I trom (I) to yield for j=1/2 the proton mass 1836, we get Ms), =2597 
close to the value of the mass of the cascade hyperon Y. With the same C/l we get from (П) 
My, = О, Ms), = 2183, My, = 2860.... Thus, formula (II) includes the lightest fermions (e, v, u,) the 
hyperon A° and the heaviest hyperon (recently discovered by Fry et al.) decaying into a neutron 
and a K-meson. 

In agreement with the general postulate that to every irreducible representation of the group 
of rotations and reflections there should correspond (at least) one type of particles, we may expect 
the existence of two boson families described by a scalar bilocal field quantity (satisfying a first 
order or a second order wave equation) and two families described by a ps-scalar bilocal field 
quantity, i. e. two tensor and two ps-tensor families altogether. Moreover, we may expect six 
families of fermions (described Бу the first and second order wave equation) with relative pari- 
ties (+), (F), ( 2), (F i). To prevent the decay of nucleons and hyperons into light fermions, we 
assume that the former possess the (-L)-type parity while the latter possess the (+ i)-type parity. 


КРАТКОЕ СОДЕРЖАНИЕ 


Райский, О билокальной теории семейств элементарных частиц 


Обобщенные билокальные волновые уравнения, ограниченные взаимностно 
инвариантными добавочными условиями описывают семейства частиц с различ- 
ными значениями спина и массы. В противоположность к предшествующему 
предположению нет непосредственной связи с теорией изотопического спина. 
Доказана тесная связь с теорией высших спинов Фирца. Кратко обсужден вопрос 
об интеракции. 
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A differential description of non-local systems is obtained by replacing the original 
integro-differential equations containing integrals over space-time by equivalent integro- 
differential equations containing integrals over a space-like hypersurface only. These 
equations provide a descriptions of the system which is local in time. The construction 
of the coefficients of the equation in question is carried out in the linear as well as in the 
general non-linear case (Sections 1, 2, 4). The calculations are illustrated on a particularly 
simple example (Section 3). The admissibility of form-factors containing poles in the 
Fourier representation (corresponding to multi-mass equations) is discussed. 


Introduction 


In the first paper of this series (Rzewuski 1954) we have obtained a differential 
description of non-local theories in the case of linear problems with one independent 
variable. Such problems correspond to mechanics and quantum mechanics, respective- 
ly and are, so far, of little practical importance, since there is no need for the consi- 
deration of non-local mechanical systems. However, the mathematical methods devel- 
oped in this simple case may now be applied to practically important field-theoretical 
problems. : 

In mechanics a non-local system is described by functions of one independent 
variable (the time) which satisfy ordinary integrodifferential equations. We have 
obtained the differential description of such systems by replacing these equations by 
` the equivalent purely differential equations of the same order in the derivatives. In 
the case of non-local field theories the original equations are partial integro-diffe- 
rential equations the unknowns being functions of points of space-time and we shall 
try to replace these equations by some other, equivalent, equations giving a differen- 
tial description of the system. It shall prove that, contrary to the case of ordinary inte- 
gro-differential equations, it is in general impossible in the case of partial integro-diffe- 
rential equations to construct equivalent purely differential equations. However, we shall 
be able to construct such integro-differential equations in which the unknown func- 
tions occur at one instant of time or at one space-like surface с only. These equations 
provide a description which although still non-local in the space variables is local 
in time. ‘ 

(121) 
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l. The linear equation (with a given external field) 


Let us consider first the simple equation 
(C —2*) g(a)= A f K (x, x’) dx'a(') (1.1) 


corresponding to equation (1.1) of Part I of this series. Here x is a point of space- 
time, dx a four dimmensional volume element and the integration domain is the 
region of space-time contained between two space-like surfaces c, and оз. Units are 
chosen.in which h — c — 1. Equation (1.1), together with its complex conjugate, 
describes a scalar charged field q, q* interacting non-locally with a given external field, 
the interaction being contained in the kernel К (x, x’). In the local limit 


К (x, x’) > Ф (x) 6 (x — x), (1.2) 


where Ф(х) is the given external field, and (1.1) goes over into the conventional local 
equation (A = interaction constant). 

Equation (1.1) is non-local in space and in time the right hand side containing an 
integral of q(x) multiplied with a weight factor К (x, х) and integrated over the 
space-time region (01,0). We shall prove the following 

Theorem: Equation (1.1) is equivalent to the equation 


(O — 4 д (0) =—A J Ala] ( x) Xia) do,’ (1.3) 


in the sense that each solution of (1.1) is a solution of (1.3) and vice versa. 

In (1.3) X, is a differential operator defined by 

ео, Oy: 

the arrows indicating the direction in which the differentiations are to be carried out. 
The integral is taken over an arbitrary space-like surface с, the quantities A[o] (x, x’) 
being functions of the points x and x’ and functionals of the surface с. Choosing т 
(1.3) for с a surface passing through x, we obtain an equation in which the unknown 
functions and their derivatives occur only at points of one hypersurface. Such an equation 
provides a description of the system which is local in time. 

Let us proceed now to the proof of the above theorem and to the construction 
of the quantities A[o] (x, x’). For this purpose we replace (1.1) by the equivalent 
integral equation 


406) = q0) +2 Јох) ах q(x), (14) 
where А 


N (2) = — [Сам dx!" K (x па), (15) 


от 
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G being an arbitrary particular solution of the equation 


(E1 — x?) С (х) = —6 (x) (1.6) 
and 9° being the general solution of the equation 
(O — x?) 9° (x) = 0 (1.7) 


If N (x, x) is integrable and Л sufficiently small, equation (1.4) possesses for 
each particular choice of g°(x) a unique solution. The general solution q°(x) of (1.7) 
contains two arbitrary functions of three variables. This is conveniently exhibited by 
expressing the solution of (1.7) by the initial values on an arbitrary hypersurface o, 
from the region (01,0) 


4°(х) = f ^( — x) X, дк) do, (1.8) 
where 4° (x’) and n,q, (х) are two arbitrary functions on 0%. A (x) is the well known 


particular solution of (1.7) satisfying the following initial conditions 


A (x) 20 for em Sat: (1.97) 
9A 5 
в, H = 1 for ø through x = 0 ( Or т EFA б (x) for x ») (1.9") 


where n, is the unit vector normal to с pointing towards the future. A possible repre- 
sentation of A(x) is the three dimensional Fourier integral 


xh „+ -зт з A x х, 
A(x) — EXE u (1.10) 


Since there is a one to one correspondence between the solutions of (1.7) and 
(1.1), the manifold of solutions of (1.1) contains also two arbitrary functions of three 


variables. 
To prove the equivalence of (1.1) and (1.3), we must show that in the space of 
solutions of equation (1.1) the following identity is satisfied 


f K(x, x’) 4х‘д (х) = — f А[о] (x, x’) X, g(x’) do,- (1.11) 


We shall carry out the proof and construct at once the quantities A[o] (x, x’) by expand- 
ing q(x) and A[ ] (x, x’) in powers of 2. Introducing the notation 


{АВ...С} (x, x’) = | P f Alx, EDdEB(E1, £9) dê*... di" C(P x), (1.12) 


о, 


we may write the solution of (1.4) in the form 


q(x) = q?(x) +4 ру ANS) (x, x’) dx’ д (x), (1.13) 
ў 4 
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where 
oo 


Y им бу) = RG x) (1.14) 


is the resolving kernel of equation (1.4). Introducing now (1.8) into (1,13) we obtain 


а(х) = [| у Я (NIA) (x) X; 4° (х')ао\, (1.15) 


th US 


The left hand side of equation (1.11) becomes now 


Е dx'q(x) = I 4 (КМА) (хх ) X, 4° (x) do, (1.16) 
i=0 


Бшш the right hand side of equation (1.11) and using (1.15), we get 


Ajo] (a,x) = 3 A A9] (x, x’), q (x) = У АА), (1.17) 


і = 0 


where 
g® (a) = f {NAY Gs a?) X, qt?) do, (1.18) 


and finally 
те ГА] (x, x ) X, а (x’) da, 


-—) NEU (x, х") Х, fers (x",x) X, q^ (x) de, de. (1.19) 


i=0k 
Equating coefficients in (1.16) and (1.19), one gets 
-Y| d 1] do’, A® [o] (x, ху XZ (NIA (^, x) X; ° (к) 


k=05 


> | KI») (x, x’) Худ (х) до, (i =0,1,2,...). (1.20) 
J 


This equation is an identity in the space of solutions of (1.1) and, therefore, in the 
functions 4°(х’) and n, "S (x') on су. In virtue of the tehorem of Du Bois Reymond 
— D f^ AQ [7] (x, x") X, (NUI) GU, x) = (KN'A) (x, х) (= 0,,...). 
=e (1.21) 
This is an infinite set of linear equations for the determination of the coefficients 
A® [o] (x, x’). Considerable simplification is achieved by remembering that o° and 


Pye 
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therefore the point x’ in (1.21) are quite arbitrary. If we choose x’ to be a point on ø, 


which is equivalent with putting со equal to c, equations (1.21) become, due to the 
properties (1.9) of the A-function, 


A9 [o(x')] (x, x’) = — GON'A) (x, х) 
i—1 
-- у fe A [o (x' )] (x, x' ) X, ЛО) (1.22) 
k=0%., 
olx’) 
The notation c (х) indicates that we have chosen the point x' to lie on c. Equations 
(1.22) are recurrent formulae for the coefficients AP [0(х)] (x, х). 


The convergence of the procedure follows easily from the assumed convergence 
of (1.15), which completes the proof of the theorem. 


2. Properties of the quantity A [o(x)](x, х) 


It is necessary for consistency that the right hand side of the new equation (1.3) 
is independent of o in the space of its solutions. That this is indeed the case follows 
from a certain property of the quantity Afa(x’)| (x, х). 

Consider the three first approximations 


Ao(x')] (x, x) = — {KA} (x, x); 


AG[o(x)] x) = — (КМА) (x, =) — do, 49 [o( 6o x”) Xp (NA) G^, x°). 


olx’) а 
до] (х, =) = — (КМВА) (x, x) — [4 AOL) (ma) Х„ QUA). (7,0) 
a(x’) 
— f as; 49 LN (2) X; (NA) (е, я). (2.1) 
a(x’) 
One easily shows that 
8AO[o(x^) I(x, x") _ 0 
óc(x'^) fen 
6AM 4 > i „ " / Fi ven 
NEN — mpeg А". qua 
SAD] x) _ 1 [ох (а x) AM [o()] (^s =) 
бо(х'”) 


+ Ао (хе, x") AM [о] œ) 650), 
where 6/60(x’’) is the functional derivative with respect to o at the point x". For the n-th 
approximation one obtains by induction | 
а Je х) _ 4-0 jo(a а =") AD lo) Gn) +... + 
o(x 
#АФ®[о(х'”)] (x, x") А-®[о(®')](®''› 5). (2.3) 
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From (2.2), (2.3) and (1.17) there follows 


дА [o(x')] (x. х) 
до(х'') 
With help of formula (2.4) we may easily calculate the functional derivative with 
respect to o (x) of the right hand side of equation (1.3) 


д ERY о озса x. M 
So) . Ale] (x, х) X, q (<) do, аи» X, q(x')do, 


= AA [o(x")] (x, х") A[o(x^)] (x^, x’). (2.4) 


с 
+ Alof) (x, 2°) (D — Б) (^), (2.5) 
where the arrows again indicate the direction in which the differentiations аге to be 
carried out. Now from the recurrent formulae (1.22) one may easily see that due to 
the occurrence of A functions at the end of each chain {...} the quantity A [o(x’’)] (x, Xx) 
satisfies the equation 


(07 — x?) Afo(x’’)] (x, x") = 0. (2.6) 
The right hand side of (2.5) becomes, therefore, 
A|e(x")] (5^) [A [Ale] (^, x')X,q (4) da, + (Г1'' — х®)а('?)] =0 (2.7) 


on account of (1.3). The functional derivative of the right hand side of equation (1.3) 
vanishes, which is necessary for its internal consistency. 

This result shows that we may move arbitrarily the surface c in the region between 
2, and сз. In particular we may take с to pass through that point x which occurs as 
argument of the left hand side of equation (1.3). In this case we get an integro-differ- 
ential equation in which the unknown function q(x) and its derivatives occur at points 
of one space-like surface o only. Such an equation has local character with respect 
to directions normal to c, although it is still non-local with respect to directions tangent 
to о, due to the occurance of surface integrals. Thus if we introduce a continuous 
parameter т numerating the surfaces с from o, to оз, then equation (1.3) provides 
us with a description of the behaviour of the field q(x) which is purely differential in v. 
This is easily seen if one integrates equation (1.3) by means of an arbitrary particular 
solution G (x) of equation (1.6) 


4б) = 4°@) + 2 f СА), x’) X, q (v?) do’, (2.8) 


Here the surface integral is again independent of с due to the property (2.4), so that 
we may by means of (2.8) express the solution q at the point x by g? at the same point 
and by the values of q and its normal derivative 9» = — n,q, at an arbitrary space-like 
surface с. 

Another question we shall investigate now is the behaviour of A[o(x^)] (x, x") 
in the local limit (1.2). In this case the right hand side of equation (1.3) must go over 
into the local expression | 


A D(x) q(x). (2.9) 
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Consider the first two approximations of Alo(x’)] (a,x) (cf. (2.1)) 


As (x^)] (x, x’) > — D(x) A (x — x’) (2.10) 
AO [o (x^)] (x,x^) > — Ф (x) {NA} (x, x^) 4- [45:26 A (x —x") > {МА у(х", x’). 
olx’) 
It is easily seen that for о (x’) through x the distance Xy — х, is space-like and, due 


to the properties (1.9) of the A function, A™[o(x’)](«, x je 0 together with its first 
derivative with respect to x,. The same is easily shown by induction for the next 
approximations, so that equation (1.3) becomes in the local limit 


(D — x2) g(x) = A Hi  (x)A (x —x')X,q (x) do, = AG(x) q(x). (2.11) 
a(x) 


3. Example 


It is instructive to illustrate the developments of Sections 1 and 2 on a particular 
example. Let us consider equation (1.1) with 


A (x, x1) = mL (x — х), (3.1) 
where 
== I еихи 
An(x) = лу р {ле dk. (3.2) 


This type of form-factor describes the rather trivial case of a constant external field. 
However, its study is of some interest since in the case (3.1) the calculation of the 
quantity A[o(x’)] (x, x’) is especially simple and the character of the general procedure 
is exhibited particularly clearly. On the other hand example (3.1) throws some light 
on the problem of multi-mass equations. 

We notice first that A,, satisfies the inhomogeneous equation 


(D — m3) A, = — 909. (3.3) _ 


Multiplying equation (1.1) on the left with the operator (Г] — m?), we get, therefore, 
if the kernel is defined by (3.1), 


(п — т”) (0—3) 4 (0) = — Ат? (a). | (3.4) 


This fourth order equation is by no means equivalent to (1.1) since it possesses twice 


| as much solutions. Apart from this fact it has the disadvantage of an indefinite energy 


(cf. e.g. Pais and Uhlenbeck 1950). Equation (3.4) may be Ша diately solved by 
Fourier transformation: 


q(x) = q(k) e^. 
(k? + m?) (kj +x?) = — Ат? (3.5) 


y 
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т? +x? т? — и? 4 Am? 
Ku ao С» ren (3.6) 


We have thus four independent solutions corresponding to the four values of ko 
given by (3.6). In each solution q(k) is an arbitrary function of the remaining three 
variables Ё; (i = 1, 2, 3). 

Let us now apply the methods of Sections 1 and 2 to equation (1.1) with the 
particular choice (3.1) of the form-factor. To make explicit calculations by means of 
Fourier transforms possible we shall assume that the surfaces o1 and c, move towards 
— оо and + оо respectively, so that the integration domain is the whole of space- 
time. We must also make a particular choice for the solution of equation (1.6). It is 


.convenient to take 
С (x) = A), (3.7) 


A „(х) being defined by a formula analogous to (3.2) with х in place of m. It should be 
noted that the particular choice for G has no influence upon the results. 
For the first three approximations of A[o(x )] (x,x') we get after some calculations 


— m? 
(m? х?) 


AO(x, x) = А59), 


т^ т^“ (2) 


A(x, x’) = m Aa — x) + mE AVG =x); (3.8) 


2m$ 


m$ 
Аа) = Ga Aa) + oe 06—90) + 88) 


m$ ; 
аар 
where Л, (х) is defined by (1.10) and A(x) ЛО (л) and in general A (x) are defined 
by the contour integral 
eikux, 


il 
BPO mima qum dies 
| (0 = ел) (E лу Р (3.9) 
where the integration limits for the real variables k4, ka, Ёз are — оо and + ee whereas 
the integration over k, is to be carried out over a closed path in the complex ky — 
plane surrounding (in the positive sense) both poles of the integrand. It may be easily 


verified that with the above definition (3.9) 
AD (а) = A(x) (3.10) 


For the derivation of (3.8) the following property of the AM (x) — function was of 
importance 


(A, Ay (05, 2°) — f Au a") X, (A, AD) (^, x) de, = ДН (x —x’). (3.11) 
a(x’) 
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It may be shown further that the (п — 1) st approximation is given by the formula 


A@-2 (x, x’) EE n 1 lS 2n— 3 
2 21-242 ҢЫ 2 
1 4m? ry ke Р b 
Sat (т? — x3)? унд — x) + В (х—х), (3.12) 
where R(x — x’) is a linear combination of the functions A(x — x), A® (x — х), 
Adis A) (x — x"). The function AM (x — x') has the important property of vanish- 


ing together with all derivatives up to the order 2n — 2 (including) if the difference 
x — х, is a space-like vector. On the right hand side of (1.3) there occur only first 
order derivatives of A[o](x, x’). Choosing, therefore, o (x ) through x (which is equiv- 
alent with taking x, — x. space-like) we may neglect in (3.8) and (3.12) all terms contain- 
ing AM (x — ^) with n > 2 since they do not contribute to the surface integral in 
(1.3). The coefficients of A, (x — x’) in the successive approximations A form 
a series which, for sufficiently small 4, may be summed up to give 


| т? ur x 4 Am? i 4 Am? 
ay e Ni (mei) 


Introducing (3.13) into (1.3), we get finally (3.13 


(п — x?) q (х) ze |. etl е а (x). (3.14) 


The local equation (1.3) equivalent to (1.1) is in the case of. the particular kernel 
(3.1) a pure differential equation. We should like to emphasize, heowever, that this 
result is rather exceptional. 

(3.14), like (3.4), may be solved by a Fourier transformation g(x) = q(k) ee. 
For k, one gets the equation 


N ee уп... IMPR 3.15 

ky = Smee Tes 3 2 | (m3 — 2)? (5-15) 

Comparison of (3.15) with (3.6) shows that only two of the solutions of equation (3.4) 
corresponding to the positive sign in (3.6) are also solutions of (1.1) or the correspond- 
ing local equation (3.14). It may be noted that the energy corresponding to (3.14) is 
definite (we are speaking, of course, only about this part of the energy which does not 


contain the constant A). 


4. Non-linear equations 


We may apply the methods of sections 1 and 2 also to nonlinear equations. Let 
us take the case of a charged scalar field ү, y* interacting non-locally with a neutral 
scalar field ф. The integro-differential field equations are e.g. 


(оро) = л [PU g O $ Ada. 
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the complex conjugate equation, and (4.1) 


O 


(D—m3 o (1) = A f 24-2) &* (2)  Q) das. 


01 


where 


70 = f dr v O P* (2,1), 
$i (4.2) 


с 


#0) = f drp © FQ. 


Cı 


The arguments 1, 2, 3, ... denote points s v Xa ... in space-time. The form-factors 


P and Fare functions of a and Е by means of the invariant (22 — x2)? in order to secure 


translational and rotational invariance. 

The problem is now to find such integro-differential expressions in which the 
unknown functions would occur on one space-like surface only and which would be 
identical with the right hand sides of (4.1) in the space of solutions of (4.1). It proves 
that the following set of equations 


(O —) (1) = 4 [| [| А, (3) X? X? p (2) y (3) doy do; 


2 
+ 2 | | |; ЁЁ (з) X? X? xiy* (2) р (3) y (4) 
4, 


2 
vh (13) xixiXio (2) o Qv Ф| do? do? dot 
4 


2 
3 ) | 
+” ||) Азу [14] X420 Хо Хоу" (2) e B) vA) v 5) 
"Е 5 


2 
3 

+ Аз» [14| Хх; Xo Xs e (2) v (3) p (4) v (5) do? do? 40% dog +... (4.3) 
5 


the complex conjugate of this equation, and 


(D —m2)g(1) = А Ji if Bi (3) X? X? y* (2) v (3) do? do; 


2 
ق‎ ү ү | B, (13) X? X? X4y*(2) р (3) y (4) dof do; dog +... (4-4) 


is equivalent to (4.1) in the sense that each solution of (4.1) is a solution of (4.3,4) 
and vice versa. 
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Sr gerne | 


C1 
= 


The calculation of the cofficients Ay, 4,1, 4, ... By, В», ... may be carried out 


by expansion of y, y* and ¢ in powers of A. In contrast to the linear case Av А 
and Ву, B,,... are not expanded. After some calculations one gets 


2 
nel 


Л, 


A (13) Pr (Pl р ( s) 
an = P* (— x? — и?) T3 
4 28 4 
— |р(— м) | PU 2 fe (5 ies | 
о A 


2 2 
Ass (1 3) = P*( —x?) Е? (— m?) as (: 3) 
4, 4 


— F(—m?) P* (—›) | a isle] 


o 


s (3) ие) 


The coefficients Ay 4p... By --- аге, of course, also functionals of о. We do not 
write it out explicitly not to complicate the notation, In contrast to the linear case it 
is impossible to give closed recurrent formulae. In principle, however, the calcula- 
tions may be carried out to any desired order of approximation. 

"Formulae (4.3), (4.4) and especially (4.5) need some explanations. The numbers 
1, 2, 3, ... denote functional dependence on the points A yx a$,... The symbol {...} 


has a meaning closely analogous to Пе Е 


A Os о 
$ 2 
T o ;) = Ја. КОЕ A, (5,4) L (5,6) Ax (6,3) A, (6,2). (4.6) 
VI. с, O1 
Е(—т?) and P(—«?) are the Fourier transforms of F(x) and P (x) taken at 2 = — т? 
and 2 = — x? respectively. The functions A, (x) and А„() are defined by (1.10), 
the second one with т in place of #. Finally Г, К, G,, С, are given by 
Бане} AE К) (P** G, P) (1,2); (4.7) 
С, (1,2) = (GF) (12); G, (1.2) = (С, P} (2); (4.7) 
where 
р" (1,2) = F 2,1); Р" (1,2) =P 2); (4.8) 


and G,, and G, are arbitrary particular solutions of equation (1.6), the first one with m 


in place of к.“ у 


a 
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It may be shown, similarly as for the linear case in Section 2, that the right hand 
sides of equations (4.3) and (4.4) are independent of c. This is due to the following 
properties of the coefficients Ау»... „Баз 


ô 2 д о 
zig ^ 3) = aay? s) =° 
2 
ó 5 2 
3:8) AS (: ;) od \\ i) B, (5 3) (4.9) 
8 2 2 = 
306) 4,9 1 à =—A, 1 5 A, о 4 etc. 


In the local limit 


P(x) > 6(x), F(x) > б(х), 
P(— x?) — 1, F(— m?) — 1, (4.10) 


Th (1 3) > (1,2) 4, (1,3); B, o 3) — A, (1,2) A, (1,3) 


all other coefficients tend to zero if o goes through the point 2€ so that equations 
(4.3), (4.4) go over into the local equations 


(0 — =?) y = 19y; (D— m?) 9 = Av" y. (4.11) 


The above considerations are, of course, applicable also to more general types of 
form-factors like those of the Kristensen Meller (1952) theory. In this case the right 
hand sides of (4.1) are of the form . 


af f Ф (1,2,3) g (2), v (3) dx, dx, etc. (4.12) 


Equations (4.3) and (4.4) provide us with a description of the dynamical system 
which is local in the directions normal to ø. In particular one may express, similarly 
as in the linear case (2.8), the solutions y and ф at an arbitrary point x of the domain 
(01, 02) by the initial values of y, p and their normal derivatives on an arbitrary space- 
like surface с (in general not passing through x) from the domain (01,02). Further- 
more one easily sees that these initial values may be given quite arbitrarily. Indeed, 
given a particular oy, one may always choose С, and G, in such a way as to make р 
and ф coincide on o, with y? and 9? together with the first derivatives, where y? and 
99 are solutions of the unperturbed equations (A = 0) corresponding to (4.1) or 
(4.3, 4). These functions, however, are quite arbitrary on o, as is immediately seen 
from the analogues of equation (1.8) for 9 and g°. 

All these results hold, of course, in the case when the form-factors possess poles 
in the Fourier representation (cf. Section 3). This type of form-factors, interesting 
as a way of introducing families of elementary particles, is usually rejected (cf. 
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е. в. Bloch 1952 and Pais and Uhlenbeck 1950) in connection with higher order 
differential equations and the corresponding non-definiteness of energy. 

The above considerations show that the manifold of solutions of an integro-differ- 
ential equation of second order is the same as the manifold of solutions of a pure 
differential equation of the same order (provided Ў is sufficiently small. This restric- 
tion may be replaced by a less stringent one that Ў is not a characteristic value of 
a certain spectrum, of. Krzywicki and coll. 1954). Thus there is no danger of intro- 
ducing superfluous solutions giving rise to an indefinite energy. E. g. the system 
(4.1) has a manifold of soluticns containing six arbitrary real functions of three 
variables and the corresponding energy is definite. 

The solutions of (4.1) satisfy a system of local equations of higher order in the 
derivatives 

K3(—D) $9 = 210, L'C-0)9 = AD, (4.13) 
where K-! and L-! are the reciprocal Fourier transforms of the functions K (x) and 


L(x) defined by (4.7). If Р and F have Fourier transforms of the type 
^ 2\1 = 2\1 
u ее. 
n( + =) от zn ( + E) resp., (4.14) 


then (4.13) are equations of finite order — 2 in the derivatives and possess, there- 
fore, a manifold of solutions extending the manifold of solutions of (4.1) (cf. also 
Section 3). The energy corresponding to the system (4.13) is indefinite. 

It may be of some interest, in view of the possibility to describe all particles of 
one family by means of one set of equations (cf. Yukawa 1953), to consider systems 


of equations of the type (4.1), e. £» 
M y: 

(n—4 воа) = Y 24 | POLA 2 oe as (û = 2...) 628) 
k=1 = 

the complex conjugate equation, anu 


N E. 
(Om?) 92а) = У An i p (1,2) $* (2) (2) dra (i = 12, ... M) 
pat с i 


with 
фе (1) = f daa v^ @ Р®* 21) ( =1,...) 


$9 (1) = fax ф® (2) FO (2,1) (Е = 1,... М). (4.16) 

с; 
If the form-factors P (1,2) and F (1,2) are of the type (4.13), then the solutions 
of (4.14) satisfy higher order multi-mass equations (cf. 4.13). If instead of (4.15) we 
` take the corresponding equations with the general form-factor (4.12), we have equa- 
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tions of the type proposed by Yukawa (1953) in connection with a theory of the mass 
spectrum of elementary particles. A system of the type (4.15) possess a definite 
energy and describes two families of fields corresponding to particles with masses 
жү... An; My... ти interacting non-locally with each other. 

It may be noted that the local system (4.3,4) is essentially non-linear in contra- 
distinction to the original non-local system which is linear in the fields y, y* and 
linear but inhomogeneous in the field ф. By essential non-linearity we mean that 
in the equation for y, y itself occurs non-linearly and similarly in the equation for g. 
The above fact shows an interesting correspondence between non-linear and non- 
local systems. 

The differential quantization of equations of the type (4.11) with form-factors 
of the type (4.14) was carried out by the author (Rzewuski 1953) by means of an 
indirect method (through the intermediary of higher order equations). We hope to 
give in a subsequent paper a direct method of differential quantization for equations 
of the general type (4.1) or (4.3, 4) without the restriction to the particular type (5.14) 
of the form-factors. 


КРАТКОЕ СОДЕРЖАНИЕ 


Ржевуский, Дифференциальная структура нелокальных теорий, II. 


Показано на примере одномерного линейного интегро-дифференциального 
уравнения как получить дифференциальное описание нелокальных систем. 
В $ 2 доказана эквивалентность интегро-дифференциальных и чисто дифферен- 
циальных уравнений того самого порядка при некоторых предположениях о ядре. 
В $ 3 и 5 проведена дискуссия коэффициентов дифференциального уравнения. 
$ 4 посвящен каноническому квантованию системы. Наконец, в § 6, дан другой, 
непосредственный подход к рассматриваемому вопросу. 
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DISINTEGRATION OF THE Ве NUCLEUS IN COULOMB FIELD 


By Jerzy SAWICKI 
Institute of Theoretical Physics, University of Warsaw, Warsaw 
(Received June 7, 1954) 


The cross-section for the disintegration (Be? + Be* + n) of high velocity Be?-nuclei 
by heavy nuclei (ZS>1) and the disintegration of Be?-nuclei by high velocity protons 
is calculated by Dancoff's method. Attention is centered on those processes in which 
there is no direct nuclear collision and where the disintegration is due to electric forces. 
The cross section с for 170 MeV Be?-nuclei is of the order 42 x 10-29Z? cm? and 
3.5x 10-2972 cm? for 265 MeV Be? -nuclei. For 18,8 MeV protons с = 4,75. 10-29 cm? 
and for 29.14 MeV protons с = 4.1 x 10-2 cm?. Quantitatively this process seems usually 
to be less important than the disintegration involving a direct nuclear collision. 


Dancoff (1947) investigated the cross section for the disintegration of high veloc- 
ity deuterons by heavy nuclei. He considered only disintegrations due to the Cou- 
lomb field (excluding the processes coming from direct nuclear collisions), mainly 
in order to compare the cross section with that for stripping processes. Serber (1947) 
found that the estimates of Dancoff indicate that about one quarter as many neutrons 
would be produced by the Coulomb field disintegrations as by pure stripping. 
Dancoff found o = 2 - 10-297? cm? for 200 MeV deuterons. It seems interesting to 
investigate the same process for Be®. We treat Be?-nucleus according to the nuclear 
shell model composed of two bodies: Be? + n, the neutron being bound with the 
energy = = 1.63 MeV ша Py, state.! We calculate the cross section in the frame in 
which the Be? -nucleus is initially at rest. The heavy nucleus (Z> 1) or the proton 
(2 = 1) is taken to be incident along the x axis with the velocity v. If v is sufficiently 
large (the incident Be? -nucleus energy i$ == 102%MeV in the laboratory frame in the 
first case and the incident proton energy == 10 MeV? in the second case), the electro- 


static potential energy of Be? is: ! 
47е? 


y = : (1) 
V Zn + (ri, — vt)? 


1 This model is very crude for nuclear processes involving high energy interactions with 
the colliding particle, e.g. the photodisintegration for a y-quantum energy >20MeV, but in the 
present case we are dealing with a small Coulomb interaction of a radius >> Re (radius of the 
Coulomb barrier). Besides, this model fits to the description of the distortion n-Be? of the Be?- 


nucleus in an external electrical field. 

2 Though the mass ratio of a proton and Be? = !J, is comparatively large, the Coulomb inter- 
action here is Z times smaller. Consequently for the energies involved (> 10 MeV) the deformation of the 
proton path due to the Coulomb field is negligible. Besides, the resulting isotropy of the cross-sec- 
tion predicts the déformation of the proton path to be even less important than in the case of the deuteron 
(Serber 1947). 

(155) 
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perma e Sr кесш с AA 


where 74, and r,, are the components of the Be? coordinate along the x axis and per- 
pendicularly to it?. For energies involved here the Born approximation should give 
ral 
reasonably good results except for the heaviest elements. We have r, = R + r, 
У 5 1 9 
where R is the radius vector of the Ве? center of gravity and г is the vector from neu- 
tron to Ве8. The wave functions of the initial (a) and final (b) states are 


0; 


|| 


жөн ЧЕ М 
p, = Lhe Py (г) ef Ton 
E 


b 


V, = Ге) eat; m k. 


|| 


E,, are the total energies of the respective states; E, = Eg, — €, E, = Eg, + E, 


a 
(Egand E, are respectively the center of mass and the relative energy of Be?); L is 


the side of а cube and v, (г) are the wave functions of the internal motion of Be?, 


рь (г) being the continuous spectrum eigen-function normalized in the energy scale. 

In our frame of reference Ep, =0 and E,— E, = ho = E, + £ + Er, where 
h2k2 hi}? 

Е, = = Rb = 7003 
2u 18m 

mass, k,—the relative momentum vector. For the ground state Ру, we take the 

Be? — п interaction in the form of a spherical well: — V, = 12:16 MeV with a ra- 


dius rq = 5.10713 cm. According to the nuclear shell model, we have 


8 
m being the nucleon and u = 5 mthe neutron reduced 


Ч 1 yy Ка I, j 1 7 
vas (ti I ТЕГО) ui (rer +m) Уи (0, Ф) a (о) 


V 
+ (+3 —m) уу» (6,9) 86) - о 


а (0), B(^) are the spin functions. For the Ps), (a) state 


- giin маиы 
Р, = {Pa (т | о)), m;o—3.—3 33: 

and 

| Jem 

Ra (r) = AV з. (Br) ~ r ro 

(3) 

Ra (r) = Bi GE e «Xr -r) => 

where 


A 
TUM 
B, = sin Вто 


8 oe . | * . + * 

The disintegration due to magnetic interaction with the neutron magnetic moment is not consid- 

ered here, as the cross section found by Guth and Mullin for magnetic photodisintegration is much 
smaller than that for electric photodisintegration. 
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Air, | * 4 2 M 
T js | E + аго) (2) = (1 ото) (2) =1| Eur EIE 


The differential cross section is (Dancoff 1947): 


L\ - 
MIS. ТЗ. 
da= IMEET? (5) f(Q,) dE, dQ, d, Е (4) 
+00 
—h- iw 2 4Ze? 
Moha fae em = hit = 5 (5) 


where, because of аат = dr, dr and R = p T 


я 
236 (6) 


o. 
and 
оа, у 
I = [drei s "v0 Ya 09: (1) 
f (9,) is the angular distribution factor of the final state. It is easy to show that simi- 
Jarly as for the deuteron (Dancoff 1947) in the important range of the variables kr/9 
lies well under 0,1.3 Consequently we have 
E . kr 
e3y"cl—ig (8) 
and 
) une ik 
E | ағ vs O ра) = 5 <>. (Та) 


ermits P— S and Р-> D transitions. We shall 
S — state we take the function used 


hotoeffect of Be? 


The selection rule Al = + 1 p 
first calculate the more important da pg. For the 
by Guth and Mullin (1949) in their work on the p 


3 After Dancoff we have Ё or R, being equal to radius of Z-nucleus + radius of Be? where 


r does not exceed effectively the Be? radius. 
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Ra, ice r OX T 


д, = (E) * sin [k,(r -— + де] vee (9) 
Z To 


sch? 


where 


AA В 
A, sin уго = T Е i sin до, Ag cos yro— © Г) cos бу 


2р ү ie 
p= a (Vs + Е) and V, = 3 MeV. 


Then we have similarly to Guth and Mullin’s result: 


|I (P—-S)|?fes(2;) = |Rps(kr)|% (10) 


a i - 5 5 


where X denotes summation over the final and averaging over the initial states and 


И 
ды E a 
r (s + y*etg yr)” 


| 2 [92 — 82) вто) 
2 | осы 5 (a2 + kr)? 


RÊS) = Г Ry, Rardr 
0 


В 2 
242 (4) A) uro pan + Rd 
OF — py о + ie)? 


2) 2 
3 + arg + kr, (1 + 28] уго etg Yro 


(11) 


For the D state we have 


Riz (r) == И = 


ЕЗ rà 


~T Z To (12) 


[20% LT E 
^- Qu) (Zs 
3 (3 — #10) sin 0, — ЗЕ, rp cos д, - (5) sin 6, — k, rg cos д5 
(8 — y? r$) sin Yro — 3yr, cos угу YJ sin y rg — угу COS Уго 
With the help of Guth and Mullin's result, we have 


k? 1 (34 48 
Wahu Е n 2 | Еро (5... (3) 


— Ju (ут) ~ r Tg 


where 


|1, (Р Р)? - fep (О) 
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where: y = X (k, k) and if: О = 1 — yr, ctg yr, P = 1 — у|12: 


2 uk, V^ 
"p ё 3: ) 
Rpp (kr) za 


(2 ro (30Р + ke ro)? + (30Р + k? у ctg угу)?) ” 


{ а 2 Е (1 + ar) — arola? +) 2 (1 + aro) у? — a? ro (y? — = 
(ха + ke)? О 


аро | (a? + kê) (1 +aro + За) + 2а? 


(a? + #2)? 
(y? — В?) (1 + ary + @?/у®%) + = 
(yap) 
(1+ ar) — a? ro (a3 + 12) 
2-350 Е== === і (14) 


The E -distribution does not interest us and we first integrate over that variable. 
Eliminating after Dancoff by means of the cut off procedure the contributions from 
the direct nuclear collisions and taking kmax = Ro’, Ro being equal to the radius of 
the Z-nucleus + the radius of the Be®-nucleus, we obtain 


k? w 2 1. +оо Y cos6 
k,— — Omin 


ee j 
fast ee sine | 29202 = La? In Г; (15) 


v 
where ® is the polar angle of with the x axis as polar axis and! 


x hv 
Г = (cos лый = (e+ E) Ro 


because the contributions to the integral over Е in the limit L> co are restricted 


to the point k, = k cos 9 = D . Similarly 


, о\ L : 
ifs ota sin? [(«—3) 4 A 3 cos? Ө mer: 
(#3) 


+ sin? 0, In Г | : (16) 
Е MIU a Re 
212 k? k? 


k? Е CIN 
1 Here we have: Aw = £ + E, aan z e+ E, The Jast term TE which is the energy 


in the center of gravity motion is very small, 
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where 0, = X (k,, Ox). Finally we obtain 


; ; , 41832 34 
ас = dops + dopp = д e 729 | (Res (k,) |? ex 25 [Rep tar) In T 
p |Врь (x) |? (8 (3 cos? 6, —1) (I — Г-2) + sin? 0, In г) ex; 40, (17) 


For the incident protons our frame of reference is the laboratory frame. The angular 
distribution obtained here is, in contradistinction to that for the deuteron, nearly 


dG 5-2 4729; Ст? 
"Ty 10 [т 


йел 
i 


05 1 2.. 25 Er(MeV) 


d 
Fig. 1. - as a function of E, for P—S and Р D transitions. Incident Be? energy = 170 MeV? 


re 


аб | 7-2, „12, cm? 
— 1210 (Mey 


‚ dlp 


05 1 2 3 Е (М 


ас 
Fig. 2. ЗЕ as a function of E, for P—S and P—D transitions. Incident proton energy = 29,14 MeV 
Le 


isotropic by virtue of the predominant role of the isotropic dap (see Fig. 1. and 2.) 
This fact awakes the hope to distinguish in part between our process and the disinte- 
gration due to the direct nuclear collisions with anisotropic angular distribution. 
For instance, the angular distribution of deuterons for the Be? (p, d) Be? reaction was 
experimentally found by Cohen (1953) to havea very pronounced forward peak (cal- 
culations for this reaction are being prepared). However, from (17) we observe a small 
forward scattering. On integrating do’ over @„ we have 


, & 028a - А 
do = 2 Be T9 (|Rps (®,) [8 + 2|Крь (k,)|?) In T dE, (18) 


Е,) К h 
1 Г-1 = cos бы = 0 To <1; nonce (E) = 5 — e 
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do 
JE has a sharp maximum (see Fig 1. and 2.) near E, = 0.15—0.25 Me V due to do ps. 


The integration over E, was carried out graphically. On taking Ry == 10 !?cm (o is 
very insensitive to changes of Ro), we find for 170 MeV Be?-nuclei 
nodus 2.095 10:55 «72 em", Gasp 1.69. 10229 = 72 0m* 
С = Ops + Орр = 4.24 · 10-29 - Z?cm?; 
and for 265 MeV 
арх = 2.08 OL Cm / 955 = 1.431029 Zicm? 
G = оре Гор = 3.51: 10-29. 2 ста. 
For incident 18,8 MeV protons 
Ops = 2.94 · 102% cm"; Орр = 1.81: 10-? cm* O = у: 10-29 cm? 
and for 29,14 MeV protons 
Ope = 202 107% em", dap = 1.6 · 19-29 cm?, о = 4.1 · 10-22 спа? 


I would like to express my best thanks to Professor Infeld for his kind interest in this 
work and to cand. J. Dabrowski for many helpful discussions. 


КРАТКОЕ СОДЕРЖАНИЕ 


Савицкий, Дезинтарация ядра Ве? в кулоновом поле 


Вычислено методом Данкова эффективное сечение дезинтеграции (Ве —>Ве +n) 
быстрого ядра Ве? тяжелым ядром (Z>1) и дезинтеграции ядра Be? быстрыми 
протонами. В этой работе внимание сосредоточено на явлениях, в которых нет 
непосредственного ядерного столкновения и дезинтеграпия обусловлена электри- 
ческими силами. Эффективное сечение g для ядра Be? с энергией 170 MeV имеет 
порядок величины 42. 10-2972 см? и 3,5. 10-2972 см? для ядра Be? с энергией 
265 MeV. Для протонов с энергией 18,8 MeV в = 4,75 . 10-29 см2, а с энергией 29,14 
MeV g = 4,1. 10-29 см?. Количественно этот процесс, повидимому, имеет меньшее 
значение чем дезинтеграция обусловленная непосредственным ядерным столк- 
новевением. 
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ANGULAR DISTRIBUTION OF DEUTERONS FROM Be*(p,d)Be?.I*) 


J. DĄBROWSKI and J. SAWICKI 


Institute of Theoretical Physics, University of Warsaw, Warsaw 


(Received September 2, 1954) 


The pickup process Be? (p, d) Be? is investigated. The differential cross section is 
calculated with use of Born's approximation. Contrary to the simplifying assumptions used 
by Bhatia et al. the exact wave functions of beryllium are used in the calculations under 
the assumption of the nuclear shell model of the Be? nucleus. А formula available for 
numerical calculations is obtained. 


Introduction For some years past the (d, p), (d, n) stripping process has been 
used succesfully in nuclear spectroscopy. There are two basic theoretical methods 
at our disposition: the one of Butler (1951) and Born's approximation method. Both 
methods were found by Gerjuoy (1953) to be fairly equivalent. However, in the up-to- 
date applications of Born's approximation Bhatia et al. (1952) as well as Daitch and 
French (1952) have made some simplifying assumptions which are difficult to justify. 

Recently the reversed pickup process was investigated by means of the reciproc- 
ity theorem. In the present paper the theory of the Be? (р, d) Be? process is devel- 
oped by means of Born's approximation. The exact wave functions of Be? are applied 
under the assumption of the nuclear shell (two body) model for the Be? nucleus. 
Thus we avoid the simplifying assumptions mentioned above. The aim of the pre- 
sent paper is to explain the experimental data of Cohen et al. (1953)! on the angular 
distribution of the Be? (p, d) Ве process and to verify the Be? — model used in the 
calculations. | 


Formula for the cross section. In the calculation of the scattering amplitude, 
we neglect the neutron — Be® interaction compared with the neutron — incident 
proton interaction. We do so because the n-p interaction is responsible for the process 
investigated and because in our case this interaction is stronger than the n- Be? one.? 


* The present paper is a more detailed development of the note of Dabrowski and Sawicki (1954). 
1 Recently analogical experiments are reported by Reynolds and Standing (1954). 
2 This question will be discussed more fully in П. 
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The p-Be® interaction is neglected and particularly the Coulomb effect is not taken 


into consideration. 
The differential cross section in the center-of-mass system is given under our 


assumptions by? 


do, 
dQ, 


= (12лї®)# MaM; (ы) (1/8) X |W}, Q) 
mtd 

W = [dritdzdc do yj (8) ул (00,09) © ба Va (00,7) у (80) Xo (o,) e^», (2) 

where M; — 0.9 M, M, — 1.6 M are > respectively the proton and deuteron reduced 


masses M being the nucleon mass; hk, Ak, are respectively the proton and deuteron 
momenta (in the c. m. system); у 2 Xi» Xp Xp are respectively the deuteron, Be?, 


Be? and proton internal functions; 0 is the radius-vector from o to proton, 


> 


r, — from Be® to neutron, rz = 255 +5 0 — from Be$ to deuteron, r, = E г, Но — 
from Be? to proton; op, 0, are tree the neutron and proton spin coordinates, 
Е are the internal Ве8 coordinates; т,» и» ма are respectively the Be?, proton and 
deuteron magnetic numbers, V4 is the deuteron potential. 

For Be? we use the two body (п — Bef) model. For the п — Be? interaction we 
take a spherical well of depth V, and radius го. This type of interaction was succes- 
fully used in the theory of electro-disintegration (Mamasachlisow (1943), Guth and 
Mullin (1949) and photo-disintegration of beryllium (e. g. Guth and Mullin (1949). 
Mamasachlisow found that for the neutron binding energy ¢ = 1.72MeV the 
values Г, = 11.96 Me V and ry = 5 · 10-18ст give the best agreement with the 
experimantal data. Guth & Mullin's and Überall's results are very similar. Under our 
assumption, we have 


x; (fra n) = Xy (8) Pm; (On Pn In) R (rp); (3) 
Om;R is the wave function for the Руз (ground) state given e. g. by Überall, where 


1 
Фи; (9,9,0,) = МУ (iim Hn 


“пт; 


123 
il > т) Ү 1! (9,95) Xen (On), (3a) 


where Хи„(о„) is the spin function of the neutron; 


К = түз Jaja (Brn) for rn Xo 
ix 
К = — NN, er ү Ну (ал) Тот те сл, 


% % 
wherea = (Be), - E и 5] usm, 


3 For spinless particles and rigid center of force one can obtain the analogue of Eq. (1) from the | 
equations given by Clementel (1954, Eq. (18)). 


(3b) 
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№ = demie (bro)? + S? 
Toro (no A) 3(1 + аго) + (arg)? 
№ = — теё En)? + 5°] -% 


% 
with S = no (1 + ar) +1, m = (“= e 


1 GUN US 
( 5 Mln at а m) are the Clebsch-Gordan coefficients. 


For the deuteron potential V; we take a well of depth U, and radius a. Then x, 
is the S — ground state wave function for the above V}: 


Ха (0 On 0) E Sug (с, с) р (о), (4) 
А sin 9/0 for o <a, 
v(e) = Uo — ёа Ae-ae—a) for о >a, (4a) 
р Uo 
х M(U, — ~ _ УМ 
where A -V za = ва), UBER : ёа , and =4 is the deuteron 


binding energy. 
On inserting expressions (3) — (4) for y; and Xq into (2), changing the variable 
of integration (dr, = dr,), and noting that 


| : 
f da, doy Sg (0495) Xun (On) Xup (с) = (5 : $ Un Up | = E 1 м) , (5) 


we find 
ay at Wak 1 о 
UNE (н shin) К? | 122 т), 
Ham] 
‚ | беро) Valo) f dz, Rlrnje n YEU Onpa), (6) 
where 
> || > > > > 8 > 
К = = И 9 k= d— № 


1 16 
K? = а 1 ja + 5 — kak, cos Os, k? = laor (3) و‎ D kakp cos 0, (ба) 


©, being the scattering angle in the с. m. system. 


The integral over ọ is 


со 
-> ج‎ 4 . 
f doe iK? y*(o) Va (0) = X f doo V (0) ү(о) sin Ко 
0 


| use?) 
_ 227. (sin (ei Kal(x + К) — sin (x — Kalle —K)} = € (К). 


7. Dabrowski and J. Sawicki 


146 
On expanding e` ™ in a series of spherical waves: 


bus со 4f DA k n 
аст 9а № (iy (21 + 1% YOP)" a (8) 


110 
and taking in the expansion the only term giving a nonvanishing contribution to the 
integral (with J = 1), we have 


[d.R 69 e Yr = —i? V3x I, (E) ôm,o 


I, (k) -4 [| dr, rl В (ry) Л, (Era Vern (9a) 
0 


is identical with the integral calculated by Uberall (1953, Eq. (13)) in the theory of 


the a, of Be? at large energies: 


sin k i 
LE) = = = В sin Вто Е "0 _ cos 2 — k sin kro (52 "0 __ cos pr) | 
0 0 


Ма sin kro р 1 

Pro С (m — cos 2 — k sin Ато (1 + ij : (9b)‏ سا 
РУТ Та 1‏ 
g Ми Шр za Wd ly ™ Hu‏ 3 


we have as a final result: 


do, I i * * ka 2 Ê 
do. _ (2) мам; (де) зя wenn (11) 


The numerical results, their comparison with experimental data and the discus- 


where 


Since: 


уз 


mij y Ud 
ug m] 


sion of the beryllium well parameters will be the subject of the forthcoming part П. 


КРАТКОЕ СОДЕРЖАНИЕ 


Домбровский и Савицкий, Угловая дистрибуция Jm OH 
Bc (p,d) *Be. 1 HHOYS ронов из реакции 


-Исследовано процесс pickup” 9Ве(р, а)8Ве. Вызислено дифференциальное 
эффективное сечение при употреблении приближения Борна. При вычислениях 
употреблялись точные волновые функции берилия при положении. оболочечной 
модели ядра 9Be, противоположно к упрощающим положениям употребляемых 
Батеия и другими. Полученная окончательная формула полезная для числовых 
вычислений. i 
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LETTERS TO THE EDITOR 


ON THE PROBLEM OF SYSTEMATIZATION 
OF HEAVY MESONS AND HYPERONS 


JERZY. Rayskı 
Department of Physics, Nicholas Copernicus University, Torun, Poland 


(Received October 15, 1954; revised April 9, 1955) 


We assume tentatively that heavy particles may be clasified according to spin 
and parity (with respect to reflections of space exes x, y, 2). The л — meson possesses 
spin 0 and a minus parity (pseudoscalar), the next member of the pion family (the 
т — meson) should possess spin 1 and a plus parity (pseudovector), etc. According 
to the parity of the first member of the family we denote the pion family as the “(—) 
boson family“. There should exist another family: the “(--) boson family“ whose 
first member (with spin 0) possesses a plus parity (scalar). Te individual parity alter- 
nates in in each family and is denoted by + or —. We introduce also the notion: 
“principal parity“ denoted by (+) or (—) (always in a bracket). The principal parity 
is the name of the family and, at the same time, it is the (indiviudal) parity of the 
first member of the family | 7 


(—) boson family (+) boson family 


(tensor) : 


(pseudotensor) 


4 mass 
parity (theoret.) 
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In the case of fermions the situation is similar. There should exist two families 
of fermions with opposite relative parities, + and +. One of the two families should, 
of course, be the nucleon family. We may distinguish the two families by calling them 


arbitrarily 


(+) fermion family (—) fermion family 
mass 


name (theoret.) 
NN? 1836* 


* Adjusted 


In consequence of the laws of conservation: (i) of angular momentum, and (ii) 
of parity, there exist selection rules prohibiting the decays: tt > лё + m" 
and Y-— N° + л- so that the т mesons have to decay into three pions, while Y- 
must be a cascade particle. 

Denoting the spinor wave functions of the nucleon family (i.e. of the (+) fermion 
family) by yr), the other by Y» and denoting the wave functions of the two boson 
families by 94) and фс) we may introduce the coupling between the fields, schem- 
atically denoted by 


gük wavu ve: bj = + 5 g(ük) = eh). (1) 
Thus, we have six coupling constants at our disposal. It will be sufficient, however, 
to distinguish only two of them by assuming 


)2( 6= + و 9—-270- )8—9 

(+ =e(+ +H ==(+——)-6 (3) 

С Эа. (4) 

3 pA T F A TOEN N 
EN № 
ү : У 

c. Ax f i" н ! 

f 2 3 4 5 6 


The elementary processes may be visualized by means of the following graphs. 
From graphs 1,2 it is seen that the life time of 4 is of the same order as the life 

time of Y~ and, on account of a small value of g, may be comparatively long. Graphs 

3,4 show that a simultaneous production of a A together with 0 is more probable than 
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the production of a A with z or v. Graph 3 shows also that the production of A’s 
is copious inspite of their long life time. Graph 5 shows that nuclear forces between 


the nucleon and A are of the same order of magnitude as the usual nuclear forces be- 
tween two neutrons whic 


graph 
6 shows that the mesons belonging to the (+) boson family are cold (practically do not 


participate in the nuclear forces) which accounts for their rarneess in comparison 
with pions. 


The theoretical mass values have been derived from the bilocal field theory 
(Rayski 1955) which yields the following formulae 
for bosons (with spin 1) or fermions with spin / +4 


4 SA a ze! 2 
М, -|/ 4@++% О (5) 


for fermions (with spin j) 


ET x? 1 3 
M -V (+3) +5 +531 = DIESE 49) 


The constant x may be simply abandoned in comparison with а except in the case? = 0. 
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AN ANALYSIS OF A A°-PARTICLE DECAY IN FLIGHT 
RECORDED IN NUCLEAR EMULSION 


Ву Р. Сток AND A. STEGNER, 
Institute of Physics, University of Warsaw, Warsaw 


(Received October 9, 1954) 


In Ilford G5 nuclear emulsion 600 y thick exposed at an altitude of about 25 000m 
during 6. 5 hours a two prong star was found the photomicrograph of which is shown 
in tab. 1. 

Two tracks, marked a and 5, start from point S, no trace of recoil being visible. 
Track a ends in the emulsion with a small one prong star (one black track and a short 
recoil), track b after 13 000 runs into the glass. 

A preliminary analysis of this event (Danysz 1953) has shown that: 

1. both tracks a and b are due to singly charged particles the masses of which are 
respectively: few hundred electron masses and (1700 + 140) m,. 

2. both particles start from point S. 

Moreover assuming particle а to be a 2 meson, particle b — а proton and 
taking the whole event as а A°-particle decay in flight one obtains for the Q of the 
decay the value: (35.5 + 2) MeV. 

As the event discussed offers rather good conditions for the determination of 
the О value of the A°-particle a more detailed analysis of it was done. All measure- 
ments were performed using a Cooke, Troughton & Simms M40121 microscope and 
were made independently by two observers. The shrinkage factor for the emulsion 
was determined from the analysis of two elastic colisions (р-р and d-p) with an esti- 
mated accuracy of + 3%. Displacements due to the distortion were evaluated with 
an accuracy of 10% in length and 3° in direction from an analysis of strongly deeping 
tracks in the vicinity of the event. The momentum of the x meson was determined 
from its range using the Friedlander et al. (1954) range-energy relation. The error in the 
determination of energy due to different humidity conditions during the exposure was 
assumed not to exceed 2%. The momentum of thé proton was determined from 
the mean grain density in its track, taking into account the necessary correction 
related to energy losses in the first half of its track. The relation p f (g*) was calibra- 
ted on several long proton tracks of similar grain density with an estimated accuracy 
of + 3.5%. š i ' 
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Both in the measurements of the л-шезоп range and in the measurements of the 
angle between the proton and meson tracks the influence of distortion was taken 
into account. 


The final results of these measurements are as follows: 


Pa = (37.4 + 0.7) MeV/c 
рь = (545 + 17) MeV/c 
gy, = 128° 40° + 1°45’ 
From these data one obtains for the energy of A°-decay the value 
О = (35.2 + 1.2) MeV, 
assuming m, = 272.5 m, = 139.24 MeV (Smith, Birnbaum and Barkas 1953) 


m, = 1836, 13 m, = 938. 23 MeV. 
These results might be compared with Friedlander's et. al. (О = 36.92 + 0.22) 
and van Lint's et. al. (1954) (О = 34.7 + 1) data. 


In evaluating the accuracy of the Q-value determination the influence of the 
following factors was taken into account: | 


(1). Uncertainty in the determination of the рр (g*) relation for protons: 


МЕУ PUT 
РВ) A (pf) = 0.95. 


(2). Errors in g* determination (statistical only): 
© 


EI A 
dr 48 = 0.39 


(3). Uncertainty in the determination of the average energy loss on the first 
half of the proton track (assumed to be not more than 20%): 


к A ae 0.28 MeV. 
9 E E dR 
dR 
(4). Uncertainty in the determination of the range-energy relation and of humidity 


conditions: 


9 
78-4 Е, = 0.22 MeV. 
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(5). Uncertainty in the z--meson energy determination (straggling): 
20 
ФЕ, ит) 
(based on Fry and White’s results (1953) for the џ-тевопз). 


(6). Uncertainty in the angle ф determination (depth measurement and scatter- 
ing of the 2~meson track): 


A Ене) = 0.23 MeV. 


20 
m Ag = 0.38 MeV. 
(7). Uncertainty in the distortion and shrinkage factor determination: 


90 
8 (dis) A (dis) = 0.03 MeV. 


© 


В 
zs 4 S= 0.01 MeV. 


where S is the shrinkage factor. 

Other considered sources of error as uncertainty of direct length, depth and 
angle measurements had proved negligible. 

We wish to thank Professor M. Danysz for his guidance and helpful discussions. 
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Acta Physica Polonica, 14, 135 (1954): 
Disintegration of the *Be Nucleus in Coulomb Field by J. Sawicki 


Recently W. Czyz (Phys. Rev., 102, 1185 (1956)) has noticed that Guth and Mullin (Phys. Rev., 76, 

254 (1949)) erroneously omitted the interference terms coming from the different P —> S and P —D 
transitions in the ?Be (у, n) angular distribution. Here the interference terms in the angular distribution 
| were also omitted after Guth and Mullin. | 
| On including these terms the following corrections should be made: Eq. (13), p. 138, should read ag 
| t 
| 2 Е 1 
| 


#1 ^ 
LL D) fep (2) = gr д @ +3 092) | Кәрі? | (13) 


Eq. (17), р. 140, should read 


с BP y à 
dg = 2 2729 [ Rps |? + | Rpp|? + 2 Rps Rpp cos (O, — д) In Г + 


ps (i Rpp|* — Rps Rpp cos (б — 2 s (8 cos? 0,— 1) (1 — I7?) + sin? 0, In r)| (17) 


D. Since the interference terms vanish after integration over (2, the cross-section — do]d E, (Eq. (18)) 
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